- 😄 Welcome to my blog Mageseの飛行器
magese / ik-analyzer-solr Goto Github PK
View Code? Open in Web Editor NEWik-analyzer for solr 7.x-8.x
License: Other
ik-analyzer for solr 7.x-8.x
License: Other
ES7 能直接用吗?
能不能通过配置来实现禁止使用原始词库的需求?
请问分词结果怎么过滤单个字符呢?如果源词就只有一个字符那么就直接返回源词,如果原来的词是多个字符例如 “我是**人”, 那么分词结果只保留 “我是**人”, “我是”,“**人”, “**”,不再要“人”
我在网上查配置 isMaxWordLength="false"
貌似不生效
作者大大您好:
我使用ik-analyzer-solr添加词语分词正常,但是有些业务是需要把这些词删除的.
我发现删除dynamicdic.txt中的一些词语之后,分词还是会有.重启solr之后才会有效.
请问一下作者 有么什么好的办法 不重启自动加载的 谢谢
Solr 8.5.2 单机环境,
不启用用户主词典的情况下。 扩展词典有效果。
启用用户主词典的情况下。 扩展词典好像没有效果, 是不是优先级没有主词典高。 别忽略了。
用户扩展词典是不是第一优先的呢?
好像停用词也不起作用。
請問如何支援同義字搜尋?
如题,谢谢
用的lucene8.0,没有使用solr。谢谢。
例如,1200万吨/年催化裂化装置,这个词中想让/不被过滤掉应该怎么处理,加到扩展词中没有起作用
Solr 9 出来了,大神加油!
能否不同的index用不同的词库
support solr8
RT
最开始我看的博客也是星火燎原,https://www.cnblogs.com/liang1101/articles/6395016.html,
一直报空指针异常,换了大佬这个包之后还是在这一句代码处报异常,dynamicdic文件是不能为空么,ik我也放在class文件下了,但是仍然解析不了这个文件,异常报错如下,请大佬指教。
IKTokenizerFactory 1081633527 inform conf: ik.conf
parsing ik.conf NullPointerException!!![org.apache.solr.core.SolrResourceLoader.openResource(SolrResourceLoader.java:407), org.wltea.analyzer.lucene.IKTokenizerFactory.canUpdate(IKTokenizerFactory.java:124), org.wltea.analyzer.lucene.IKTokenizerFactory.update(IKTokenizerFactory.java:98), org.wltea.analyzer.lucene.IKTokenizerFactory.inform(IKTokenizerFactory.java:79), org.apache.solr.core.SolrResourceLoader.inform(SolrResourceLoader.java:720), org.apache.solr.schema.IndexSchema.<init>(IndexSchema.java:176), org.apache.solr.schema.ManagedIndexSchema.<init>(ManagedIndexSchema.java:105), org.apache.solr.schema.ManagedIndexSchemaFactory.create(ManagedIndexSchemaFactory.java:173), org.apache.solr.schema.ManagedIndexSchemaFactory.create(ManagedIndexSchemaFactory.java:45), org.apache.solr.schema.IndexSchemaFactory.buildIndexSchema(IndexSchemaFactory.java:75), org.apache.solr.core.ConfigSetService.createIndexSchema(ConfigSetService.java:119), org.apache.solr.core.ConfigSetService.getConfig(ConfigSetService.java:92), org.apache.solr.core.CoreContainer.getConfigSet(CoreContainer.java:1073), org.apache.solr.core.CoreContainer.createFromDescriptor(CoreContainer.java:1025), org.apache.solr.core.CoreContainer.lambda$load$13(CoreContainer.java:642), com.codahale.metrics.InstrumentedExecutorService$InstrumentedCallable.call(InstrumentedExecutorService.java:197), java.util.concurrent.FutureTask.run(FutureTask.java:266), org.apache.solr.common.util.ExecutorUtil$MDCAwareThreadPoolExecutor.lambda$execute$0(ExecutorUtil.java:188), java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142), java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617), java.lang.Thread.run(Thread.java:748)]
Eager to see ik-analyzer for the Solr releases beyond 8.4.
Thanks a great deal.
solr7.7.3 在使用ik分词器的扩展词时,比如我扩展词配置了一个词 测试产品,当我输入 测试产品 时,ik分词器会将这个测试产品分成 测试产品 、测试、产品等,我只需要测试产品即可,这个要怎么实现?
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.