24
Globalcode – Open4education Trilha – Arquitetura .NET José Renato Pequeno Especialista de Sistemas Sênior - GFT

Apresentação Hadoop

Embed Size (px)

DESCRIPTION

Apresentação Hadoop

Citation preview

Page 1: Apresentação Hadoop

Globalcode – Open4education

Trilha – Arquitetura .NETJosé Renato Pequeno

Especialista de Sistemas Sênior - GFT

Page 2: Apresentação Hadoop

Globalcode – Open4education

Apresentação

• 25 anos atuando com desenvolvimento de software

• TDC 2011 – Trilha SOA• TDC 2012 – Trilhas iOS e Análise 2.0• TDC 2013 – Trilha Scala• TDC 2014 – Trilhas Arquitetura .NET, Big Data e HPC

Page 3: Apresentação Hadoop

Globalcode – Open4education

Big Data

Page 4: Apresentação Hadoop

Globalcode – Open4education

Big Data

O que é?

Page 5: Apresentação Hadoop

Globalcode – Open4education

Big Data

Big Data é

… como sexo na adolescência:

•todos falam sobre isso;•nenhum deles sabe realmente como fazer;•todos pensam que os amigos estão fazendo;•Todos dizem que estão fazendo;

Page 6: Apresentação Hadoop

Globalcode – Open4education

Big Data

Page 7: Apresentação Hadoop

Globalcode – Open4education

Big Data

• Nos últimos dois anos, criamos 90% de todos os dados disponíveis no mundo.

• Atualmente, geramos algo próximo a 15 petabytes em somente um dia, o que equivale à soma de cada palavra dita desde o início dos tempos.

Fonte : A nova era da computação Freddy Vaquero – VP de Sistemas e Tecnologia IBM Brasil

http://www.ibm.com/midmarket/br/pt/articles_nova_era_computacao.html

Page 8: Apresentação Hadoop

Globalcode – Open4education

Big Data

Como começar?

Page 9: Apresentação Hadoop

Globalcode – Open4education

Big Data

Page 10: Apresentação Hadoop

Globalcode – Open4education

Big Data

Principais Implementadores Hadoop

•Cloudera

•MapR

•Hortonworks

Page 11: Apresentação Hadoop

Globalcode – Open4education

Big Data

Distribuições Cloudera

Cloudera Enterprise

Designed specifically for mission-critical environments, Cloudera Enterprise includes CDH, the world’s most popular open source Hadoop-based platform, as well as advanced system management and data management tools plus dedicated support and community advocacy from our world-class team of Hadoop developers and experts. Cloudera is your partner on the path to big data.

•Fonte: http://www.cloudera.com/content/cloudera/en/products-and-services/cloudera-enterprise.html

Page 12: Apresentação Hadoop

Globalcode – Open4education

Big Data

Cloudera ExpressThe Best Way to Get Started with Hadoop

Cloudera Express is a free download that combines CDH, Cloudera’s 100% open source and enterprise-ready distribution of Apache Hadoop with Cloudera Manager, which provides robust cluster management capabilities like automated deployment, centralized administration, monitoring, and diagnostic tools.

•Fonte: http://www.cloudera.com/content/cloudera/en/products-and-services/cloudera-express.html

Page 13: Apresentação Hadoop

Globalcode – Open4education

Big Data

Cloudera ManagerEnd-to-End Administration for Hadoop

Cloudera Manager is the industry’s first and most sophisticated management application for Apache Hadoop and the enterprise data hub. Cloudera Manager sets the standard for enterprise deployment by delivering granular visibility into and control over every part of the data hub — empowering operators to improve performance, enhance quality of service, increase compliance and reduce administrative costs.

•Fonte: http://www.cloudera.com/content/cloudera/en/products-and-services/cloudera-enterprise/cloudera-manager.html

Page 14: Apresentação Hadoop

Globalcode – Open4education

Big Data

Cloudera ManagerEnd-to-End Administration for Hadoop

Cloudera Manager is the industry’s first and most sophisticated management application for Apache Hadoop and the enterprise data hub. Cloudera Manager sets the standard for enterprise deployment by delivering granular visibility into and control over every part of the data hub — empowering operators to improve performance, enhance quality of service, increase compliance and reduce administrative costs.

•Fonte: http://www.cloudera.com/content/cloudera/en/products-and-services/cloudera-enterprise/cloudera-manager.html

Page 15: Apresentação Hadoop

Globalcode – Open4education

Big Data

CDH100% Open Source Distribution including Apache Hadoop

CDH is the world’s most complete, tested, and popular distribution of Apache Hadoop and related projects. CDH is 100% Apache-licensed open source and is the only Hadoop solution to offer unified batch processing, interactive SQL, and interactive search, and role-based access controls. More enterprises have downloaded CDH than all other such distributions combined.

•Fonte: http://www.cloudera.com/content/cloudera/en/products-and-services/cdh.html

Page 16: Apresentação Hadoop

Globalcode – Open4education

Big Data

Distribuições MapR

M3 Standard Edition

MapR M3 is a complete distribution for Apache TM Hadoop ® MapR M3 Standard Edition is a free and complete distribution for Apache™ Hadoop® that includes Apache HBaseTM, Apache Pig, Apache Hive, Apache Mahout, Cascading, Apache Sqoop, Apache Flume, and more. MapR M3 provides the capabilities for entry-level Hadoop users to develop Big Data applications using the complete Apache Hadoop stack while providing easy management, seamless interoperability and high performance.

Fonte: http://www.mapr.com/sites/default/files/mapr020_datasheet_m3.pdf

Page 17: Apresentação Hadoop

Globalcode – Open4education

Big Data

M5 Enterprise Edition

MapR M5 is a complete enterprise-grade distribution for ApacheTM Hadoop ® MapR M5 includes Apache Hbase TM, Apache Pig, Apache Hive, Apache Mahout,Cascading, Apache Sqoop, Apache Flume and more. MapR M5 not only provides advanced high availability (HA) and data protection features such as Self Healing HA, JobTracker HA, Snapshots and Mirroring, but also enables un-precedented Hadoop access and management capabilities through industry standard interfaces such as NFS and ODBC. MapR M5, avail able on a subscription basis, is fully supported for the most demanding mission-critical deployments

Fonte: http://www.mapr.com/sites/default/files/mapr020_datasheet_m5.pdf

Page 18: Apresentação Hadoop

Globalcode – Open4education

Big Data

M7 Enterprise DataBase Edition

MapR M7 offers all the powerful features of MapR M5 Enterprise Edition, and also includes Apache projects such as Apache HBase, Apache Pig, Apache Hive TM, Apache Mahout TM, Cascading, Apache Sqoop TM, Apache Flume TM, and more.

Fonte: http://www.mapr.com/sites/default/files/mapr_datasheet_m7.pdf

Page 19: Apresentação Hadoop

Globalcode – Open4education

Big Data

MapR Sandbox

The MapR Sandbox for Hadoop provides tutorials, demo applications, and browser-based user interfaces to let developers and administrators get started quickly with Hadoop. It is a fully functional Hadoop cluster running in a virtual machine.You can try our Sandbox now - it is completely free and available as a VMware or VirtualBox VM.

Fonte: http://www.mapr.com/products/mapr-sandbox-hadoop

Page 20: Apresentação Hadoop

Globalcode – Open4education

Big Data

Hortonworks Data Platform

Architected, developed and built completely in the open, Hortonworks Data Platform (HDP) is designed to meet the changing needs of enterprise data processing.

HDP is fundamentally versatile, providing linear, scalablestorage and compute across a wide range of accessmethods, from batch and interactive to real time, search and streaming. It includes a comprehensive set of theessential data capabilities required by the modern enterprise across governance, integration, security and operations.

Fonte: http://br.hortonworks.com/

Page 21: Apresentação Hadoop

Globalcode – Open4education

Big Data

Hortonworks Data SandBox

The easiest way to get started with Enterprise HadoopSandbox is a personal, portable Hadoop environment that comes with a dozen interactive Hadoop tutorials. Sandbox includes many of the most exciting developments from the latest HDP distribution, packaged up in a virtual environment that you can get up and running in 15 minutes!

Fonte: http://br.hortonworks.com/products/hortonworks-sandbox/

Page 22: Apresentação Hadoop

Globalcode – Open4education

Big Data

Principais Players Big Data

•IBM InfoSphere Cloudera

•Oracle Oracle Big Data Cloudera

•EMC GreenPlun MapR

•Teradata Hortonworks

•Microsoft HDInsight Hortonworks

Page 23: Apresentação Hadoop

Globalcode – Open4education

Big Data

FIM

Page 24: Apresentação Hadoop

Globalcode – Open4education

Big Data

Links:

•http://br.hortonworks.com/•https://www.ibm.com/developerworks/data/library/techarticle/dm-1210bigdatasecurity/•http://www.teradata.com/business-needs/Big-Data-Analytics/?ICID=Sbda#tabbable=0&tab1=0&tab2=0&tab3=0&tab4=0•http://www.mapr.com/products/mapr-sandbox-hadoop•http://www.oracle.com/us/products/database/big-data-appliance/overview/index.html•http://www.cloudera.com/content/cloudera/en/home.html