{"id":355,"date":"2014-07-23T14:42:41","date_gmt":"2014-07-23T14:42:41","guid":{"rendered":"http:\/\/shirishranjit.com\/blog1\/?page_id=355"},"modified":"2016-04-22T08:57:44","modified_gmt":"2016-04-22T12:57:44","slug":"references","status":"publish","type":"page","link":"https:\/\/shirishranjit.com\/blog1\/big-data\/apache-and-bigdata\/references","title":{"rendered":"Machine Learning, NLP, and Search Engine References"},"content":{"rendered":"<h2 style=\"font-weight: normal;\">Links to Reference Documents on Search (lucene and Solr)<\/h2>\n<ul>\n<li>Lucene\/Solr Web resources (<a style=\"color: #3b73af !important;\" href=\"http:\/\/wiki.apache.org\/lucene-java\/InformationRetrieval\">http:\/\/wiki.apache.org\/lucene-java\/InformationRetrieval<\/a>)<\/li>\n<li>Foundations of Statistical Natural Language Processing<a style=\"color: #3b73af !important;\" href=\"http:\/\/nlp.stanford.edu\/fsnlp\/\">\u00a0(http:\/\/nlp.stanford.edu\/fsnlp\/<\/a>)<\/li>\n<li><a style=\"color: #3b73af !important;\" href=\"https:\/\/cwiki.apache.org\/confluence\/display\/solr\/Getting+Started\">Solr Reference Guide (https:\/\/cwiki.apache.org\/confluence\/display\/solr\/Getting+Started<\/a>)<\/li>\n<li><a style=\"color: #3b73af !important;\" href=\"http:\/\/blog.mikemccandless.com\/\">Mike McCandless (http:\/\/blog.mikemccandless.com\/<\/a>)\u00a0 &#8211; Blog on Lucene\/Solr &#8211; Changing Bits<\/li>\n<li>Solr\/Lucene in 5 min &#8211; (<a style=\"color: #3b73af !important;\" href=\"http:\/\/www.lucenetutorial.com\/lucene-in-5-minutes.html\">http:\/\/www.lucenetutorial.com\/lucene-in-5-minutes.html<\/a>)<\/li>\n<li>Free Text site (<a style=\"color: #3b73af !important;\" href=\"http:\/\/www.gutenberg.org\/ebooks\/84?msg=welcome_stranger\">http:\/\/www.gutenberg.org\/ebooks\/84?msg=welcome_stranger<\/a>)<\/li>\n<li>UIMA<a style=\"color: #3b73af !important;\" href=\"http:\/\/uima.apache.org\/\">\u00a0(http:\/\/uima.apache.org\/<\/a>)<\/li>\n<li>UIMA Wiki\u00a0 (<a style=\"color: #3b73af !important;\" href=\"https:\/\/cwiki.apache.org\/confluence\/display\/UIMA\/Index\">https:\/\/cwiki.apache.org\/confluence\/display\/UIMA\/Index<\/a>)<\/li>\n<li>Apache UIMA Solrcas (<a style=\"color: #3b73af !important;\" href=\"https:\/\/uima.apache.org\/d\/uima-addons-current\/Solrcas\/SolrcasUserGuide.html\">https:\/\/uima.apache.org\/d\/uima-addons-current\/Solrcas\/SolrcasUserGuide.html<\/a>)<\/li>\n<li>Solr 4 UIMA Tutorial (<a style=\"color: #3b73af !important;\" href=\"http:\/\/wiki.apache.org\/solr\/Solr4UIMA\">http:\/\/wiki.apache.org\/solr\/Solr4UIMA<\/a>)<\/li>\n<li>\n<p class=\"r\">Sentiment Analysis and Visualization using\u00a0<em>UIMA<\/em>\u00a0and\u00a0<em>Solr<\/em>\u00a0(<a style=\"color: #3b73af !important;\" href=\"https:\/\/www.dropbox.com\/s\/4f1qalmvy8xg8kf\/sentiment-analysis-visualization-using-uima-solr.pdf\">https:\/\/www.dropbox.com\/s\/4f1qalmvy8xg8kf\/sentiment-analysis-visualization-using-uima-solr.pdf<\/a>)<\/p>\n<\/li>\n<li>Site with useful slides &#8211; (<a style=\"color: #3b73af !important;\">http:\/\/www.lucidworks.com\/search\/catalog?f[talkType][]=User+Case+Study+%28How+we+use+Lucene%2FSolr%29&amp;sort=lastModified+desc<\/a>)<\/li>\n<li>Google Blog on Helping computers understnad language (<a style=\"color: #3b73af !important;\" href=\"http:\/\/googleblog.blogspot.com\/2010\/01\/helping-computers-understand-language.html\">http:\/\/googleblog.blogspot.com\/2010\/01\/helping-computers-understand-language.html<\/a>)<\/li>\n<li>Slide share site &#8211; (number of different slides on Solr\/Lucene\/UIMA) (<a style=\"color: #3b73af !important;\" href=\"http:\/\/www.slideshare.net\/erikhatcher\/lucene-for-solr-developers-10446864\">http:\/\/www.slideshare.net\/erikhatcher\/lucene-for-solr-developers-10446864<\/a>)<\/li>\n<li>Blogs\n<ul>\n<li>(old but some interesting information)<a style=\"color: #3b73af !important;\" href=\"http:\/\/juanggrande.wordpress.com\/\">\u00a0http:\/\/juanggrande.wordpress.com\/<\/a><\/li>\n<li>Searchhub and their blog (<a title=\"http:\/\/searchhub.org\" href=\"http:\/\/searchhub.org\">http:\/\/searchhub.org<\/a>)<\/li>\n<\/ul>\n<\/li>\n<\/ul>\n<p>Tools:<\/p>\n<ul>\n<li>Luke &#8211; Solr\/Lucene index visualizer (<a style=\"color: #3b73af !important;\" href=\"http:\/\/code.google.com\/p\/luke\/\">\u00a0<\/a><a style=\"color: #3b73af !important;\" href=\"http:\/\/code.google.com\/p\/luke\/\">http:\/\/code.google.com\/p\/luke\/<\/a>\u00a0,\u00a0<a style=\"color: #3b73af !important;\" href=\"https:\/\/github.com\/DmitryKey\/luke\">https:\/\/github.com\/DmitryKey\/luke<\/a>)<\/li>\n<li>solrmeter for stress testing over Solr\u00a0 (<a style=\"color: #3b73af !important;\" href=\"http:\/\/code.google.com\/p\/solrmeter\/\">http:\/\/code.google.com\/p\/solrmeter\/<\/a>)<\/li>\n<\/ul>\n<p>Solr\/Lucence Implementation Specific Reference<\/p>\n<ul>\n<li>SolrInputDocument and dynamic field issue ticket gives a good insight into how it works (<a style=\"color: #3b73af !important;\" href=\"https:\/\/issues.apache.org\/jira\/browse\/SOLR-1357\">https:\/\/issues.apache.org\/jira\/browse\/SOLR-1357<\/a>)<\/li>\n<li>Article on solr query \u00a0 \u00a0\u00a0<a style=\"color: #3b73af !important;\" href=\"http:\/\/www.ibm.com\/developerworks\/java\/library\/j-solr-lucene\/index.html\">http:\/\/www.ibm.com\/developerworks\/java\/library\/j-solr-lucene\/index.html<\/a><\/li>\n<li>Solr Tuning \u00a0\u00a0<a style=\"color: #3b73af !important;\" href=\"http:\/\/www.appneta.com\/blog\/solr-query-performance-tuning\/\">http:\/\/www.appneta.com\/blog\/solr-query-performance-tuning\/<\/a><\/li>\n<li><a title=\"https:\/\/wiki.apache.org\/solr\/SolrCloud\" href=\"https:\/\/wiki.apache.org\/solr\/SolrCloud\">Solr Cloud Old Reference Data which is better doc<\/a><\/li>\n<li><a title=\"http:\/\/www.slideshare.net\/shalinmangar\/gids2014-solrcloud-searching-big-data?related=2\" href=\"http:\/\/www.slideshare.net\/shalinmangar\/gids2014-solrcloud-searching-big-data?related=2\">SolrCloud: Searching Big Data (slideshare)<\/a><\/li>\n<\/ul>\n<p>Solr Performance Benhmarking<\/p>\n<ul>\n<li><a title=\"http:\/\/lucidworks.com\/blog\/benchmarking-the-new-solr-near-realtime-improvements\/\" href=\"http:\/\/lucidworks.com\/blog\/benchmarking-the-new-solr-near-realtime-improvements\/\">http:\/\/lucidworks.com\/blog\/benchmarking-the-new-solr-near-realtime-improvements\/<\/a><\/li>\n<li><a title=\"http:\/\/www.slideshare.net\/shalinmangar\/high-performance-solr?related=1\" href=\"http:\/\/www.slideshare.net\/shalinmangar\/high-performance-solr?related=1\">Tune Performance Solr (slideshare)<\/a> &#8212; Use Deep paging and cursorMark parameter to get speed instead of bulk\/classic paging strategy<\/li>\n<li><a title=\"http:\/\/www.slideshare.net\/thelabdude\/solr-performance?related=1\" href=\"http:\/\/www.slideshare.net\/thelabdude\/solr-performance?related=1\">Benchmarking Solr Performance at Scale (slideshare)<\/a><\/li>\n<\/ul>\n<p>Solr Cloud<\/p>\n<ul>\n<li><a title=\"http:\/\/lucidworks.com\/blog\/shard-splitting-in-solrcloud\/\" href=\"http:\/\/lucidworks.com\/blog\/shard-splitting-in-solrcloud\/\">Shard Spliting in Solr Cloud<\/a><\/li>\n<li><a title=\"http:\/\/architects.dzone.com\/articles\/apache-solrcloud\" href=\"http:\/\/architects.dzone.com\/articles\/apache-solrcloud\">Solr Cloud and Zookeeper in AWS<\/a><\/li>\n<li><a title=\"http:\/\/harish11g.blogspot.com\/2012\/02\/apache-solr-sharding-amazon-ec2.html\" href=\"http:\/\/harish11g.blogspot.com\/2012\/02\/apache-solr-sharding-amazon-ec2.html\">Sharding blog post<\/a><\/li>\n<li><a title=\"https:\/\/wiki.apache.org\/solr\/SolrTerminology\" href=\"https:\/\/wiki.apache.org\/solr\/SolrTerminology\">Solr Terminology<\/a><\/li>\n<\/ul>\n<p>&nbsp;<\/p>\n<p>UIMA Reference:<\/p>\n<ul>\n<li>UIMA Dictionary Annotator Documentation (<a style=\"color: #3b73af !important;\" href=\"https:\/\/uima.apache.org\/d\/uima-addons-current\/DictionaryAnnotator\/DictionaryAnnotatorUserGuide.html\">https:\/\/uima.apache.org\/d\/uima-addons-current\/DictionaryAnnotator\/DictionaryAnnotatorUserGuide.html<\/a>)<\/li>\n<li>UIMA References (<a style=\"color: #3b73af !important;\" href=\"http:\/\/uima.apache.org\/downloads\/releaseDocs\/2.1.0-incubating\/docs\/html\/references\/references.html\">http:\/\/uima.apache.org\/downloads\/releaseDocs\/2.1.0-incubating\/docs\/html\/references\/references.html<\/a>)<\/li>\n<li>Apache UIMA\/FIT (\u00a0<a style=\"color: #3b73af !important;\" href=\"https:\/\/uima.apache.org\/uimafit.html\">https:\/\/uima.apache.org\/uimafit.html<\/a>)<\/li>\n<li>uima setup\u00a0<a style=\"color: #3b73af !important;\" href=\"http:\/\/uima.apache.org\/downloads\/releaseDocs\/2.2.0-incubating\/docs\/html\/overview_and_setup\/overview_and_setup.html\">http:\/\/uima.apache.org\/downloads\/releaseDocs\/2.2.0-incubating\/docs\/html\/overview_and_setup\/overview_and_setup.html<\/a><\/li>\n<li>IBM Document\u00a0<a style=\"color: #3b73af !important;\" href=\"http:\/\/public.dhe.ibm.com\/software\/dw\/data\/uima\/UIMA_SDK_Users_Guide_Reference.pdf\">http:\/\/public.dhe.ibm.com\/software\/dw\/data\/uima\/UIMA_SDK_Users_Guide_Reference.pdf<\/a><\/li>\n<li>Google Code\u00a0<a style=\"color: #3b73af !important;\" href=\"http:\/\/code.google.com\/p\/uimafit\/\">http:\/\/code.google.com\/p\/uimafit\/<\/a>\u00a0 (This is old\u00a0\u2013 do not use)<\/li>\n<li><\/li>\n<\/ul>\n<div class=\"twttr_buttons\"><div class=\"twttr_twitter\">\n\t\t\t\t\t<a href=\"http:\/\/twitter.com\/share?text=Machine+Learning%2C+NLP%2C+and+Search+Engine+References\" class=\"twitter-share-button\" data-via=\"\" data-hashtags=\"\"  data-size=\"default\" data-url=\"https:\/\/shirishranjit.com\/blog1\/big-data\/apache-and-bigdata\/references\"  data-related=\"\" target=\"_blank\">Tweet<\/a>\n\t\t\t\t<\/div><div class=\"twttr_followme\">\n\t\t\t\t\t\t<a href=\"https:\/\/twitter.com\/shiranjit\" class=\"twitter-follow-button\" data-size=\"default\"  data-show-screen-name=\"false\"  target=\"_blank\">Follow me<\/a>\n\t\t\t\t\t<\/div><\/div>","protected":false},"excerpt":{"rendered":"<p>Links to Reference Documents on Search (lucene and Solr) Lucene\/Solr Web resources (http:\/\/wiki.apache.org\/lucene-java\/InformationRetrieval) Foundations of Statistical Natural Language Processing\u00a0(http:\/\/nlp.stanford.edu\/fsnlp\/) Solr Reference Guide (https:\/\/cwiki.apache.org\/confluence\/display\/solr\/Getting+Started) Mike McCandless (http:\/\/blog.mikemccandless.com\/)\u00a0 &#8211; Blog on Lucene\/Solr &#8211; Changing Bits Solr\/Lucene in 5 min &#8211; (http:\/\/www.lucenetutorial.com\/lucene-in-5-minutes.html) Free &hellip; <a href=\"https:\/\/shirishranjit.com\/blog1\/big-data\/apache-and-bigdata\/references\">Continue reading <span class=\"meta-nav\">&rarr;<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"parent":846,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"footnotes":""},"class_list":["post-355","page","type-page","status-publish","hentry"],"_links":{"self":[{"href":"https:\/\/shirishranjit.com\/blog1\/wp-json\/wp\/v2\/pages\/355"}],"collection":[{"href":"https:\/\/shirishranjit.com\/blog1\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/shirishranjit.com\/blog1\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/shirishranjit.com\/blog1\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/shirishranjit.com\/blog1\/wp-json\/wp\/v2\/comments?post=355"}],"version-history":[{"count":17,"href":"https:\/\/shirishranjit.com\/blog1\/wp-json\/wp\/v2\/pages\/355\/revisions"}],"predecessor-version":[{"id":1302,"href":"https:\/\/shirishranjit.com\/blog1\/wp-json\/wp\/v2\/pages\/355\/revisions\/1302"}],"up":[{"embeddable":true,"href":"https:\/\/shirishranjit.com\/blog1\/wp-json\/wp\/v2\/pages\/846"}],"wp:attachment":[{"href":"https:\/\/shirishranjit.com\/blog1\/wp-json\/wp\/v2\/media?parent=355"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}