Upload
jose-renato-pequeno
View
121
Download
0
Embed Size (px)
DESCRIPTION
Apresentação Hadoop
Citation preview
Globalcode – Open4education
Trilha – Arquitetura .NETJosé Renato Pequeno
Especialista de Sistemas Sênior - GFT
Globalcode – Open4education
Apresentação
• 25 anos atuando com desenvolvimento de software
• TDC 2011 – Trilha SOA• TDC 2012 – Trilhas iOS e Análise 2.0• TDC 2013 – Trilha Scala• TDC 2014 – Trilhas Arquitetura .NET, Big Data e HPC
Globalcode – Open4education
Big Data
Globalcode – Open4education
Big Data
O que é?
Globalcode – Open4education
Big Data
Big Data é
… como sexo na adolescência:
•todos falam sobre isso;•nenhum deles sabe realmente como fazer;•todos pensam que os amigos estão fazendo;•Todos dizem que estão fazendo;
Globalcode – Open4education
Big Data
Globalcode – Open4education
Big Data
• Nos últimos dois anos, criamos 90% de todos os dados disponíveis no mundo.
• Atualmente, geramos algo próximo a 15 petabytes em somente um dia, o que equivale à soma de cada palavra dita desde o início dos tempos.
Fonte : A nova era da computação Freddy Vaquero – VP de Sistemas e Tecnologia IBM Brasil
http://www.ibm.com/midmarket/br/pt/articles_nova_era_computacao.html
Globalcode – Open4education
Big Data
Como começar?
Globalcode – Open4education
Big Data
Globalcode – Open4education
Big Data
Principais Implementadores Hadoop
•Cloudera
•MapR
•Hortonworks
Globalcode – Open4education
Big Data
Distribuições Cloudera
Cloudera Enterprise
Designed specifically for mission-critical environments, Cloudera Enterprise includes CDH, the world’s most popular open source Hadoop-based platform, as well as advanced system management and data management tools plus dedicated support and community advocacy from our world-class team of Hadoop developers and experts. Cloudera is your partner on the path to big data.
•Fonte: http://www.cloudera.com/content/cloudera/en/products-and-services/cloudera-enterprise.html
Globalcode – Open4education
Big Data
Cloudera ExpressThe Best Way to Get Started with Hadoop
Cloudera Express is a free download that combines CDH, Cloudera’s 100% open source and enterprise-ready distribution of Apache Hadoop with Cloudera Manager, which provides robust cluster management capabilities like automated deployment, centralized administration, monitoring, and diagnostic tools.
•Fonte: http://www.cloudera.com/content/cloudera/en/products-and-services/cloudera-express.html
Globalcode – Open4education
Big Data
Cloudera ManagerEnd-to-End Administration for Hadoop
Cloudera Manager is the industry’s first and most sophisticated management application for Apache Hadoop and the enterprise data hub. Cloudera Manager sets the standard for enterprise deployment by delivering granular visibility into and control over every part of the data hub — empowering operators to improve performance, enhance quality of service, increase compliance and reduce administrative costs.
•Fonte: http://www.cloudera.com/content/cloudera/en/products-and-services/cloudera-enterprise/cloudera-manager.html
Globalcode – Open4education
Big Data
Cloudera ManagerEnd-to-End Administration for Hadoop
Cloudera Manager is the industry’s first and most sophisticated management application for Apache Hadoop and the enterprise data hub. Cloudera Manager sets the standard for enterprise deployment by delivering granular visibility into and control over every part of the data hub — empowering operators to improve performance, enhance quality of service, increase compliance and reduce administrative costs.
•Fonte: http://www.cloudera.com/content/cloudera/en/products-and-services/cloudera-enterprise/cloudera-manager.html
Globalcode – Open4education
Big Data
CDH100% Open Source Distribution including Apache Hadoop
CDH is the world’s most complete, tested, and popular distribution of Apache Hadoop and related projects. CDH is 100% Apache-licensed open source and is the only Hadoop solution to offer unified batch processing, interactive SQL, and interactive search, and role-based access controls. More enterprises have downloaded CDH than all other such distributions combined.
•Fonte: http://www.cloudera.com/content/cloudera/en/products-and-services/cdh.html
Globalcode – Open4education
Big Data
Distribuições MapR
M3 Standard Edition
MapR M3 is a complete distribution for Apache TM Hadoop ® MapR M3 Standard Edition is a free and complete distribution for Apache™ Hadoop® that includes Apache HBaseTM, Apache Pig, Apache Hive, Apache Mahout, Cascading, Apache Sqoop, Apache Flume, and more. MapR M3 provides the capabilities for entry-level Hadoop users to develop Big Data applications using the complete Apache Hadoop stack while providing easy management, seamless interoperability and high performance.
Fonte: http://www.mapr.com/sites/default/files/mapr020_datasheet_m3.pdf
Globalcode – Open4education
Big Data
M5 Enterprise Edition
MapR M5 is a complete enterprise-grade distribution for ApacheTM Hadoop ® MapR M5 includes Apache Hbase TM, Apache Pig, Apache Hive, Apache Mahout,Cascading, Apache Sqoop, Apache Flume and more. MapR M5 not only provides advanced high availability (HA) and data protection features such as Self Healing HA, JobTracker HA, Snapshots and Mirroring, but also enables un-precedented Hadoop access and management capabilities through industry standard interfaces such as NFS and ODBC. MapR M5, avail able on a subscription basis, is fully supported for the most demanding mission-critical deployments
Fonte: http://www.mapr.com/sites/default/files/mapr020_datasheet_m5.pdf
Globalcode – Open4education
Big Data
M7 Enterprise DataBase Edition
MapR M7 offers all the powerful features of MapR M5 Enterprise Edition, and also includes Apache projects such as Apache HBase, Apache Pig, Apache Hive TM, Apache Mahout TM, Cascading, Apache Sqoop TM, Apache Flume TM, and more.
Fonte: http://www.mapr.com/sites/default/files/mapr_datasheet_m7.pdf
Globalcode – Open4education
Big Data
MapR Sandbox
The MapR Sandbox for Hadoop provides tutorials, demo applications, and browser-based user interfaces to let developers and administrators get started quickly with Hadoop. It is a fully functional Hadoop cluster running in a virtual machine.You can try our Sandbox now - it is completely free and available as a VMware or VirtualBox VM.
Fonte: http://www.mapr.com/products/mapr-sandbox-hadoop
Globalcode – Open4education
Big Data
Hortonworks Data Platform
Architected, developed and built completely in the open, Hortonworks Data Platform (HDP) is designed to meet the changing needs of enterprise data processing.
HDP is fundamentally versatile, providing linear, scalablestorage and compute across a wide range of accessmethods, from batch and interactive to real time, search and streaming. It includes a comprehensive set of theessential data capabilities required by the modern enterprise across governance, integration, security and operations.
Fonte: http://br.hortonworks.com/
Globalcode – Open4education
Big Data
Hortonworks Data SandBox
The easiest way to get started with Enterprise HadoopSandbox is a personal, portable Hadoop environment that comes with a dozen interactive Hadoop tutorials. Sandbox includes many of the most exciting developments from the latest HDP distribution, packaged up in a virtual environment that you can get up and running in 15 minutes!
Fonte: http://br.hortonworks.com/products/hortonworks-sandbox/
Globalcode – Open4education
Big Data
Principais Players Big Data
•IBM InfoSphere Cloudera
•Oracle Oracle Big Data Cloudera
•EMC GreenPlun MapR
•Teradata Hortonworks
•Microsoft HDInsight Hortonworks
Globalcode – Open4education
Big Data
FIM
Globalcode – Open4education
Big Data
Links:
•http://br.hortonworks.com/•https://www.ibm.com/developerworks/data/library/techarticle/dm-1210bigdatasecurity/•http://www.teradata.com/business-needs/Big-Data-Analytics/?ICID=Sbda#tabbable=0&tab1=0&tab2=0&tab3=0&tab4=0•http://www.mapr.com/products/mapr-sandbox-hadoop•http://www.oracle.com/us/products/database/big-data-appliance/overview/index.html•http://www.cloudera.com/content/cloudera/en/home.html