sing1ee / analyzer-solr Goto Github PK
View Code? Open in Web Editor NEWanalyzer adapter for solr 5, we support Jieba, and stranford in the future
License: MIT License
analyzer adapter for solr 5, we support Jieba, and stranford in the future
License: MIT License
@sing1ee , 請問為何出來結果會重覆四筆? 如下圖, 謝謝
https://drive.google.com/file/d/0B3n0L-fAmNEXY08xWFpsZ0Rzanc/view?usp=sharing
用的jar
analyzer-solr-1.0.jar
jieba-analysis-1.0.2.jar
solr配置文件中按照指定的配置?
在solr查询报错,java.lang.AbstractMethodError
I'm using Jieba analyser to index Chinese characters in the Solr. It works fine with the segmentation when using the Anaylsis on the Solr Admin UI.
However, when I tried to do highlighting in Solr, it is not highlighting in the correct place. For example, when I search for 自然环境与企业本身 it highlight 认为自然环境与企业本身的.
Even when I search English character responsibility, it highlight responsibility.
I'm using jieba-analysis-1.0.0, Solr 5.2.1 and Lucene 5.1.0
Regards,
Edwin
请教一下,solr想使用结巴分词器自定义词典。配置完之后没有效果。请问会是哪里的问题。 @sing1ee
solr 5.3.1
analysis的FieldType: text_jieba
出現訊息
java.lang.NoClassDefFoundError: com/huaban/analysis/jieba/WordDictionary
請問該如解決?
@sing1ee 您好, 請問jieba已經可以支援Solr6了嗎? 有預估何時支援solr6?
謝謝
老楊 敬上
请问 analyzer.solr5.stanford.StanfordTokenizerFactory
analyzer.solr5.jieba.JiebaTokenizerFactory
这两个配置有什么区别吗
想要添加jeba的自定义词典,怎么设置。谢谢。
請問目前版本有支援到solr 6.10嗎?
6.10版本好像也取消了schema.xml的設定?
@sing1ee 你好:我下载你的analyzer-solr与jieba-analysis-1.0.2.jar(https://github.com/huaban/jieba-analysis)在Solr5.5.2中使用,配置文件如下:
< fieldType name="text_jieba" class="solr.TextField" positionIncrementGap="100">
< analyzer type="index">
< tokenizer class="analyzer.solr5.jieba.JiebaTokenizerFactory" segMode="SEARCH"/>
< filter class="solr.StopFilterFactory" ignoreCase="true" words="stopwords.txt" />
< filter class="solr.LowerCaseFilterFactory"/>
< filter class="solr.SnowballPorterFilterFactory" language="English"/>
< /analyzer>
< analyzer type="query">
< tokenizer class="analyzer.solr5.jieba.JiebaTokenizerFactory" segMode="SEARCH"/>
< filter class="solr.StopFilterFactory" ignoreCase="true" words="stopwords.txt" />
< filter class="solr.SynonymFilterFactory" synonyms="synonyms.txt" ignoreCase="true" expand="true"/>
< filter class="solr.LowerCaseFilterFactory"/>
< filter class="solr.SnowballPorterFilterFactory" language="English"/>
< /analyzer>
< /fieldType>
报错信息如下:
2016/8/11 下午3:10:33 ERROR null HttpSolrCall null:java.lang.RuntimeException: java.lang.NoSuchFieldError: word
null:java.lang.RuntimeException: java.lang.NoSuchFieldError: word
at org.apache.solr.servlet.HttpSolrCall.sendError(HttpSolrCall.java:607)
at org.apache.solr.servlet.HttpSolrCall.call(HttpSolrCall.java:475)
at org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:257)
at org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:208)
at org.apache.catalina.core.ApplicationFilterChain.internalDoFilter(ApplicationFilterChain.java:192)
at org.apache.catalina.core.ApplicationFilterChain.doFilter(ApplicationFilterChain.java:165)
at org.apache.catalina.core.StandardWrapperValve.invoke(StandardWrapperValve.java:198)
at org.apache.catalina.core.StandardContextValve.invoke(StandardContextValve.java:108)
at org.apache.catalina.core.StandardHostValve.invoke(StandardHostValve.java:140)
.................
Caused by: java.lang.NoSuchFieldError: word
at analyzer.solr5.jieba.JiebaTokenizer.incrementToken(JiebaTokenizer.java:38)
at org.apache.solr.handler.AnalysisRequestHandlerBase.analyzeTokenStream(AnalysisRequestHandlerBase.java:188)
at org.apache.solr.handler.AnalysisRequestHandlerBase.analyzeValue(AnalysisRequestHandlerBase.java:127)
at org.apache.solr.handler.FieldAnalysisRequestHandler.analyzeValues(FieldAnalysisRequestHandler.java:220)
at org.apache.solr.handler.FieldAnalysisRequestHandler.handleAnalysisRequest(FieldAnalysisRequestHandler.java:181)
.............
请问我是哪里配置错了?该如何解决,谢谢!!!
不知道怎么把结巴这个分词配置到里面,请有什么说明文档吗?谢谢!
比如果分词库放到那里。谢谢!
您好,我是Bloomberg L.P. 的工程师,请问我们能使用您的这份代码吗?如果可以的话,能请您上传一份Licence文件吗?谢谢。
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.