

APACHE LUCENE SOLR FULL
Prior to Box, Shubhro worked on full text database search at Oracle. Currently he is part of the Search team at Box, building infrastructure components that enable millions of users to find relevant content. Shubhro enjoys working with data at scale, be it indexing, mining or analyzing it. Come take a peek under the hood of Box Search. We will share our learnings from various issues we have faced running Solr at scale and how we have address them by building additional scaffolding or tweaking Solr itself. But what does availability really mean for Search and how do we measure it? Once measured how do we ensure a multi-cluster deployment of Apache Solr with terabyte scale sharded inverted index hits the holy grail of 4 9s of availability ? How do we automatically detect failures with such systems and what are our options to handle and recover from such failures without human intervention ? In this talk we will discuss various architectural choices and deployment strategies we have adopted at Box to improve availability of search while supporting high-throughput, near real-time indexing, low latency and multi-tenancy. Improving Search Availability: Striving for more 9sĪvailability is a critical aspect of any distributed system, especially when your customer's mission critical applications depend on it. When not working, he enjoys photography, dogs, photography of dogs.

APACHE LUCENE SOLR SOFTWARE
He is a PMC member and committer on several Apache projects, and strongly believes that when people develop breadth in their expertise it builds better software all around. Topics covered would include query patterns, indexing patterns, and shard design.Īn engineer with over a decade of distributed systems experience, Mike has spent most of his career helping enable others who are using big data platforms. We will draw on stories from the presenter's operational experience and distill the events into easy to understand patterns and anti-patterns. There are many pitfalls that a team can fall into when designing and implementing a new Solr-based search application.
