36
DATA SCIENCE AND DATA VISUALIZATION BY:- GROUP 3

Data Science and Data Visualization (All about Data Analysis) by Pooja Ajmera

Embed Size (px)

Citation preview

Page 1: Data Science and Data Visualization (All about Data Analysis) by Pooja Ajmera

DATA SCIENCE AND DATA VISUALIZATION

BY:-

GROUP 3

Page 2: Data Science and Data Visualization (All about Data Analysis) by Pooja Ajmera
Page 3: Data Science and Data Visualization (All about Data Analysis) by Pooja Ajmera

Overview of Data Science

• Data science is, in general terms, the extraction of knowledge from collection of data.

• Its emphasis is on “statistical methods at large for collecting, analyzing, modelling” data and its applications.

Page 4: Data Science and Data Visualization (All about Data Analysis) by Pooja Ajmera

Data science is therefore used in applications like:-

Statistical Learning

Data Processing,

Development And

Management Of Databases

Data Warehousing Data Mining

Page 5: Data Science and Data Visualization (All about Data Analysis) by Pooja Ajmera
Page 6: Data Science and Data Visualization (All about Data Analysis) by Pooja Ajmera

DATA MINING

• Data mining software is one of a number of analytical tools for analyzing data.

• It allows users to analyze data from many different dimensions or angles, categorize it, and summarize the relationships identified.

• It enables these companies to determine relationships among "internal" factors and "external" factors.

Page 7: Data Science and Data Visualization (All about Data Analysis) by Pooja Ajmera

Data warehousing

• Data warehousing can be said to be the process of centralizing or aggregating data from multiple sources into one common repository.

• It is basically excluding data that are useful in decision support process

Page 8: Data Science and Data Visualization (All about Data Analysis) by Pooja Ajmera

• The process of data

mining consists of three stages:

• (1) The Initial Exploration, • (2) Pattern Identification • (3) Deployment

Page 9: Data Science and Data Visualization (All about Data Analysis) by Pooja Ajmera

Data Mining Process

Page 10: Data Science and Data Visualization (All about Data Analysis) by Pooja Ajmera

STATISTICAL LEARNING

Page 11: Data Science and Data Visualization (All about Data Analysis) by Pooja Ajmera

• Statistical learning refers to a set of tools for modeling and understanding complex datasets.

• It blends with parallel developments in computer science and in particular machine learning.

Page 12: Data Science and Data Visualization (All about Data Analysis) by Pooja Ajmera

• Pattern recognition is one aspect of artificial intelligence.

• One learn to distinguish patterns of interest,• To make reasonable decisions about the

categories of the patterns

Page 13: Data Science and Data Visualization (All about Data Analysis) by Pooja Ajmera

• Regression example: plot of 10 sample points for the input variable x along with the corresponding target variable t.

• Green curve is the true function that generated the data.

Page 14: Data Science and Data Visualization (All about Data Analysis) by Pooja Ajmera

Polynomial curve fitting: plots of polynomials having variousorders, shown as red curves, fitted to the set of 10 sample points

Page 15: Data Science and Data Visualization (All about Data Analysis) by Pooja Ajmera

Polynomial curve fitting: plots of 9’th order polynomials fitted to 15 and 100sample points.

Page 16: Data Science and Data Visualization (All about Data Analysis) by Pooja Ajmera

DatDATA VISUALIZATION

Page 17: Data Science and Data Visualization (All about Data Analysis) by Pooja Ajmera

BAR CHART

PIE CHART

HORIZONTAL BARS

SIDE-BY-SIDE CHART

TREND LINE TEXT

TABLES

CIRCLES VIEW

AREA CHARTS

SCATTER PLOTS

PACKED BUBBLES

Page 18: Data Science and Data Visualization (All about Data Analysis) by Pooja Ajmera

Why To Use Data Visualizations?

• ANALYSING• SUPPORT THE STORY• TELL STORY

Page 19: Data Science and Data Visualization (All about Data Analysis) by Pooja Ajmera

Tools help us to See

TRENDS

CORRELATIONSPATTERNS

Page 20: Data Science and Data Visualization (All about Data Analysis) by Pooja Ajmera

IN DECISION MAKING

Page 21: Data Science and Data Visualization (All about Data Analysis) by Pooja Ajmera

GOOGLE FUSION TABLES

Page 22: Data Science and Data Visualization (All about Data Analysis) by Pooja Ajmera
Page 23: Data Science and Data Visualization (All about Data Analysis) by Pooja Ajmera
Page 24: Data Science and Data Visualization (All about Data Analysis) by Pooja Ajmera

Big Data (Data Visualization’s Best friend)

• Big Data is an ocean of structured and unstructured data which is too and large and complex to process.

• This data is used to reveal patterns, trends and associations.

Page 25: Data Science and Data Visualization (All about Data Analysis) by Pooja Ajmera
Page 26: Data Science and Data Visualization (All about Data Analysis) by Pooja Ajmera
Page 27: Data Science and Data Visualization (All about Data Analysis) by Pooja Ajmera

TREEMAPS

SCATTER PLOT

GANTT CHART

STEAM GRAPH

DATA VISUALIZATION

Page 28: Data Science and Data Visualization (All about Data Analysis) by Pooja Ajmera

Application of Big Data Visualization

Healthcare

E-commerce

Government

Page 29: Data Science and Data Visualization (All about Data Analysis) by Pooja Ajmera

Healthcare

http://www.tableausoftware.com/solutions/healthcare-analytics

Page 30: Data Science and Data Visualization (All about Data Analysis) by Pooja Ajmera

E-Commerce

E-commerce companies use Big data in two different ways:•Real-time Analysis•Past Behavior of Customer

Page 32: Data Science and Data Visualization (All about Data Analysis) by Pooja Ajmera

Data Science In Neuroscience

Page 33: Data Science and Data Visualization (All about Data Analysis) by Pooja Ajmera

Use Of Data Science:

Medical Informatics

Biological Sciences

Health Care Social Sciences

Humanities

Page 34: Data Science and Data Visualization (All about Data Analysis) by Pooja Ajmera

Neuroinformatics

• Data science in neuroscience- Neuroinformatics.• What is neuroinformatics?

Page 35: Data Science and Data Visualization (All about Data Analysis) by Pooja Ajmera

Thunder tool• Developed at the Howard Hughes Medical Institute’s Janelia

research campus.• Built on Apache Spark Platform.• Open-source software .• Runs on Amazon's cloud computing services. • Distributed computing.• Speeds the analysis of large data sets .• Analyze highly-detailed images of the brains.

Page 36: Data Science and Data Visualization (All about Data Analysis) by Pooja Ajmera

THANK YOU!!!