Links to Reference Documents on Search (lucene and Solr)
- Lucene/Solr Web resources (http://wiki.apache.org/lucene-java/InformationRetrieval)
- Foundations of Statistical Natural Language Processing (http://nlp.stanford.edu/fsnlp/)
- Solr Reference Guide (https://cwiki.apache.org/confluence/display/solr/Getting+Started)
- Mike McCandless (http://blog.mikemccandless.com/) – Blog on Lucene/Solr – Changing Bits
- Solr/Lucene in 5 min – (http://www.lucenetutorial.com/lucene-in-5-minutes.html)
- Free Text site (http://www.gutenberg.org/ebooks/84?msg=welcome_stranger)
- UIMA (http://uima.apache.org/)
- UIMA Wiki (https://cwiki.apache.org/confluence/display/UIMA/Index)
- Apache UIMA Solrcas (https://uima.apache.org/d/uima-addons-current/Solrcas/SolrcasUserGuide.html)
- Solr 4 UIMA Tutorial (http://wiki.apache.org/solr/Solr4UIMA)
-
Sentiment Analysis and Visualization using UIMA and Solr (https://www.dropbox.com/s/4f1qalmvy8xg8kf/sentiment-analysis-visualization-using-uima-solr.pdf)
- Site with useful slides – (http://www.lucidworks.com/search/catalog?f[talkType][]=User+Case+Study+%28How+we+use+Lucene%2FSolr%29&sort=lastModified+desc)
- Google Blog on Helping computers understnad language (http://googleblog.blogspot.com/2010/01/helping-computers-understand-language.html)
- Slide share site – (number of different slides on Solr/Lucene/UIMA) (http://www.slideshare.net/erikhatcher/lucene-for-solr-developers-10446864)
- Blogs
- (old but some interesting information) http://juanggrande.wordpress.com/
- Searchhub and their blog (http://searchhub.org)
Tools:
- Luke – Solr/Lucene index visualizer ( http://code.google.com/p/luke/ , https://github.com/DmitryKey/luke)
- solrmeter for stress testing over Solr (http://code.google.com/p/solrmeter/)
Solr/Lucence Implementation Specific Reference
- SolrInputDocument and dynamic field issue ticket gives a good insight into how it works (https://issues.apache.org/jira/browse/SOLR-1357)
- Article on solr query http://www.ibm.com/developerworks/java/library/j-solr-lucene/index.html
- Solr Tuning http://www.appneta.com/blog/solr-query-performance-tuning/
- Solr Cloud Old Reference Data which is better doc
- SolrCloud: Searching Big Data (slideshare)
Solr Performance Benhmarking
- http://lucidworks.com/blog/benchmarking-the-new-solr-near-realtime-improvements/
- Tune Performance Solr (slideshare) — Use Deep paging and cursorMark parameter to get speed instead of bulk/classic paging strategy
- Benchmarking Solr Performance at Scale (slideshare)
Solr Cloud
UIMA Reference:
- UIMA Dictionary Annotator Documentation (https://uima.apache.org/d/uima-addons-current/DictionaryAnnotator/DictionaryAnnotatorUserGuide.html)
- UIMA References (http://uima.apache.org/downloads/releaseDocs/2.1.0-incubating/docs/html/references/references.html)
- Apache UIMA/FIT ( https://uima.apache.org/uimafit.html)
- uima setup http://uima.apache.org/downloads/releaseDocs/2.2.0-incubating/docs/html/overview_and_setup/overview_and_setup.html
- IBM Document http://public.dhe.ibm.com/software/dw/data/uima/UIMA_SDK_Users_Guide_Reference.pdf
- Google Code http://code.google.com/p/uimafit/ (This is old – do not use)