SparkとMLlibで実現するかんたん高速機械学習

Embed Size (px)

DESCRIPTION

Hadoop Conference Japan 2014 LT 2014/7/8

Citation preview

  • 1. SparkMLlib R&D @yamakatu HadoopConferenceJapan2014 2014/7/8

2. WhoareU!!! @yamakatu R&D R&D Gihyo.jpMahout IPA 3. SparkMLlib 4. Hadoop 5. Hadoop 6. Spark 7. or SpotInstance+Spark+MLlib 8. Ref. Spark:AframeworkforiteraPveandinteracPveclustercompuPng hTp://laser.inf.ethz.ch/2013/material/joseph/LASER-Joseph-6.pdf 9. 10. 11. Whats 12. Spark 13. 14. 15. 16. 17. Spark 18. MLlib 1.0 SVM ridge Lasso GLM() K-Means ALS AtochasPcGradientDescent NaiveBayse DecisionForestRandomForest SVD PCA LBGFS 19. Spark1.1 20. 21. Java JavaSparkContextsc=newJavaSparkContext(newSparkConf().setAppName("JavaLR)); JavaRDDpoints=sc.textFile(args[0]).map(newParsePoint()).cache(); LogisPcRegressionModelmodel=LogisPcRegressionWithSGD.train( points.rdd(),Integer.parseInt(args[2]),Double.parseDouble(args[1]) ); sc.stop(); 4+ 22. 23. Hadoop100 24. 25. Ref. Spark:AframeworkforiteraPveandinteracPveclustercompuPng hTp://laser.inf.ethz.ch/2013/material/joseph/LASER-Joseph-6.pdf 26. 27. 28. Hadoop by 29. Hadoop Spark 30. 31. 32. 33. 34. HadoopSpark Spark SparkMLlib MLlib Spark 35. HaveaniceMachineLearning!!