3

Learning Spark

Embed Size (px)

DESCRIPTION

spark

Citation preview

1. introduction

TableofContents

LearningSpark

2

TranslationthebookofLearningSpark:Lightning-FastBigDataAnalysisisonlyforsparkdevelopereducationalpurposes.IfIviolatedyourcopyright,pleaseletmeknow.

《LearningSpark:Lightning-FastBigDataAnalysis》的中文翻译纯属个人对于Spark的兴趣,仅供学习。

如果我的翻译行为侵犯您的版权,请您告知,我将停止对此书的开源翻译。

HoldenKarauisasoftwaredevelopmentengineeratDatabricksandisactiveinopensource.SheistheauthorofanearlierSparkbook.PriortoDatabrickssheworkedonavarietyofsearchandclassificationproblemsatGoogle,Foursquare,andAmazon.ShegraduatedfromtheUniversityofWaterloowithaBachelorsofMathematicsinComputerScience.Outsideofsoftwaresheenjoysplayingwithfire,welding,andhulahooping.

Mostrecently,AndyKonwinskico-foundedDatabricks.BeforethathewasaPhDstudentandthenpostdocintheAMPLabatUCBerkeley,focusedonlargescaledistributedcomputingandclusterscheduling.Heco-createdandisacommitterontheApacheMesosproject.HealsoworkedwithsystemsengineersandresearchersatGoogleonthedesignofOmega,theirnextgenerationclusterschedulingsystem.Morerecently,hedevelopedandledtheAMPCampBigDataBootcampsandfirstSparkSummit,andhasbeencontributingtotheSparkproject.

PatrickWendellisanengineeratDatabricksaswellasaSparkCommitterandPMCmember.IntheSparkproject,PatrickhasactedasreleasemanagerforseveralSparkreleases,includingSpark1.0.PatrickalsomaintainsseveralsubsystemsofSpark'scoreengine.BeforehelpingstartDatabricks,PatrickobtainedanM.S.inComputerScienceatUCBerkeley.Hisresearchfocusedonlowlatencyschedulingforlargescaleanalyticsworkloads.HeholdsaB.S.EinComputerSciencefromPrincetonUniversity

MateiZahariaisthecreatorofApacheSparkandCTOatDatabricks.HeholdsaPhDfromUCBerkeley,wherehestartedSparkasaresearchproject.HenowservesasitsVicePresidentatApache.ApartfromSpark,hehasmaderesearchandopensourcecontributionstootherprojectsintheclustercomputingarea,includingApacheHadoop(whereheisacommitter)andApacheMesos(whichhealsohelpedstartatBerkeley).

codeshttps://github.com/gaoxuesong/learning-spark/forkedfromhttps://github.com/databricks/learning-spark

LearningSpark:Lightning-FastBigDataAnalysisChinesetranslation

AbouttheAuthor

ExamplesforLearningSpark

LearningSpark

3introduction