Apache Solr - 5.0 and beyond

Apache Solr - 5.0 and beyond
Anshum Gupta
Apache Lucene/Solr PMC Member and Committer
Who am I?
•
Anshum Gupta, Apache Lucene/Solr PMC member
and committer, Lucidworks Employee.
•
Interested in search and related stuff.
•
Apache Lucene since 2006 and Solr since 2010.
•
Organizations I am or have been a part of:
What is Lucene?
•
Apache Lucene is a free open source information
retrieval software library
•
Originally written in Java by Doug Cutting.
•
It is supported by the Apache Software Foundation
and is released under the Apache Software
License.
What is Solr?
•
Solr (pronounced "solar") is an open source
enterprise search platform
•
Written in Java,
•
For a while now, a part of the Apache Lucene
project.
•
Search on Lucene - Replicated (SoLR)
•
SolrCloud - Distributed feature set
Apache Solr is the most widely-used search
solution on the planet.
You use
everyday.
Solr has tens of thousands of
applications in production.
Solr is both established
and growing.
8,000,000+
Total downloads
250,000+
Monthly downloads
2,500+
Open Solr jobs and the largest
community of developers.
Apache Solr is also one of the most active open
source projects out there
Activity statistics
30 Day Summary
Mar 14 2015 — Apr 13 2015
160 Commits
23 Contributors
Annual commits up
12 Month Summary
Apr 13 2014 — Apr 13 2015
1440 Commits
31 Contributors
+126 (9%)
via https://www.openhub.net/p/solr
Solr Feature Release
Frequency
Solr Essentials
•
Search - Full text, Geo-spatial
•
Faceting - Values, Ranges, Pivots, etc.
•
Suggestor, highlighting, auto-complete
•
Pluggability
•
and of course, Speed and Scalability
What’sTitle
new Text
in Solr 5x?
Ease of Use
•
Get started in < 5 minutes
•
APIs, and more APIs
•
Schema
•
Config
•
Collections
•
Auto* - Failover, leader election, addition of replica!
•
One of the best official documentation, released almost with
the code.
Scalability and Performance
•
Thousands of collections - Apple
•
Billions of Documents - Box
•
High throughput and near real time Bloomberg
•
Impressive indexing performance: 150 k docs/
sec per node
Solr Scalability is unmatched
Reliability
•
Tons of tests and quality code
•
Critical systems running in production
•
Jepsen tests - Proven again!
•
Independent benchmarking and testing
Features and more!
•
Analytics - Do more with your data!
•
Distributed IDF
•
It’s an app not a war!
Solr News
What’s coming?
•
Scalability
•
Faster search - SOLR-6810
•
Improved indexing - SOLR-6816
•
Analytics - HyperLogLog - SOLR-6968
•
Security - Authentication and Authorization
framework - SOLR-7230
•
And tons more!
The largest Lucene/Solr conference in the world
OCT 13 - 16, 2015
AUSTIN, TX
CFP is open until May 8, 2015
For more details visit:
http://lucenerevolution.org
Connect @
http://www.twitter.com/anshumgupta
http://www.linkedin.com/in/anshumgupta/
[email protected]