Click here to load reader

Building Search Engines - Lucene, SolR and Elasticsearch

  • View
    102

  • Download
    3

Embed Size (px)

Text of Building Search Engines - Lucene, SolR and Elasticsearch

  • www.anant.us | [email protected] | 202.905.28181010 Wisconsin Ave, NW | Suite 250 | Washington, DC 20007

    Research & Development Comparing Lucene / SolR / Elastic &

    Cloud Search Providers

    Building Search Engines

  • What do we do?

    Streamline, Organize & Unify

    Business Information

  • Agenda

    Challenge - Why does this matter? Info Retrieval - Retrieval / Routing Lucene - More than meets the eye ... Search Engine - 30k Foot View On Premise - Lucene / SolR / Elastic Cloud Providers - Amazon / Azure

  • Challenge Why does this matter?

    Knowledge

    Project Information

    Client Service Information

    CorporateGuides

    Collaborative Documents

    Assets& Files

    Corporate Resources

    Appleseed Framework (Portal, Base, Search)

    G Drive Delta

    DropBox

    G Drive Delta

    NutshellDropbox

    Freshbooks

    G DriveG Sites (KB)

    G DriveWorkflowy

    Evernote

    G DriveDropBox

    OwnCloud

    PocketLeaves

    AIC (WP)Anant (WP)

  • Document Retrieval Google Search Amazon Search LinkedIn Search CMS Search * Portal Search * CRM Search * Search *

    Document Routing Google Alerts Amazon Recommendations Netflix Recommendations LinkedIn Recommendations

    Information Retrieval

  • Lucene Inverted Index

  • Lucene More than meets the eye

    WhoNext?

    Think of it like a NoSQL Database that has great indexing.. everywhere.

  • Search Engine 30 Thousand Foot View

    The search index is only as good as your processed data. If you put everything you find in your index, you are going to spend a lot of time telling people how to search.

  • On Premise Lucene / ES / SolR

    Lucene Library

    File System

    Format

    Fast

    Embeddable*

    Indexing Anywhere

    Need to really know Lucene

    No Interface

    No server

    Lots of house keeping

    SolR Server Admin / REST

    Interface

    Configurable Scalable Great at Text* Truly Open 10+ Years Good ecosystem Too customizable Schemas* Zookeeper Needed

    ElasticSearch Server

    Configurable

    Scalable

    Good ecosystem

    Built in Clustering

    Grouping / Filtering

    Great for Logs

    Started as a Cloud Tool

    No great OTS Interface

    Only REST Interface

  • Cloud Search Amazon / Azure

    Amazon SolRCloud*

    AWS* Ecosystem

    5 QParsers

    Dynamic Fields

    100% Completely Managed

    Been Around for a While

    Data / Read Writes

    No nested Objects

    Azure ElasticSearch*

    Azure* Ecosystem

    2 QParsers

    100% Completely Managed

    Good SDK

    Few Years Old

    Data / Read Writes

    No nested Objects

    Not so Dynamic Fields

  • Questions & Contact

    www.anant.us | [email protected] | 202.905.28181010 Wisconsin Ave, NW | Suite 250 | Washington, DC 20007

    @anantcorp

    facebook.com/anantCorp

    linkedin.com/company/anant

    [email protected]/in/xingh

    Rahul SinghCEO & Founder

    Questions & Contact

    Modern Enterprise Mastering Services in the Service of Others Hybrid Agile Project Management Building Search Engines CICD / DevOps Connecting Internet Software

  • www.anant.us | [email protected] | 202.905.28181010 Wisconsin Ave, NW | Suite 250 | Washington, DC 20007

    Streamlined DataIntegration / Data Pipelines

    Organized KnowledgeSearch / Data Warehouses

    Unified InterfacesPortals / Dashboards / Mobile