DATA SCIENCE AND DATA VISUALIZATION
BY:-
GROUP 3
Overview of Data Science
• Data science is, in general terms, the extraction of knowledge from collection of data.
• Its emphasis is on “statistical methods at large for collecting, analyzing, modelling” data and its applications.
Data science is therefore used in applications like:-
Statistical Learning
Data Processing,
Development And
Management Of Databases
Data Warehousing Data Mining
DATA MINING
• Data mining software is one of a number of analytical tools for analyzing data.
• It allows users to analyze data from many different dimensions or angles, categorize it, and summarize the relationships identified.
• It enables these companies to determine relationships among "internal" factors and "external" factors.
Data warehousing
• Data warehousing can be said to be the process of centralizing or aggregating data from multiple sources into one common repository.
• It is basically excluding data that are useful in decision support process
• The process of data
mining consists of three stages:
• (1) The Initial Exploration, • (2) Pattern Identification • (3) Deployment
Data Mining Process
STATISTICAL LEARNING
• Statistical learning refers to a set of tools for modeling and understanding complex datasets.
• It blends with parallel developments in computer science and in particular machine learning.
• Pattern recognition is one aspect of artificial intelligence.
• One learn to distinguish patterns of interest,• To make reasonable decisions about the
categories of the patterns
• Regression example: plot of 10 sample points for the input variable x along with the corresponding target variable t.
• Green curve is the true function that generated the data.
Polynomial curve fitting: plots of polynomials having variousorders, shown as red curves, fitted to the set of 10 sample points
Polynomial curve fitting: plots of 9’th order polynomials fitted to 15 and 100sample points.
DatDATA VISUALIZATION
BAR CHART
PIE CHART
HORIZONTAL BARS
SIDE-BY-SIDE CHART
TREND LINE TEXT
TABLES
CIRCLES VIEW
AREA CHARTS
SCATTER PLOTS
PACKED BUBBLES
Why To Use Data Visualizations?
• ANALYSING• SUPPORT THE STORY• TELL STORY
Tools help us to See
TRENDS
CORRELATIONSPATTERNS
IN DECISION MAKING
GOOGLE FUSION TABLES
Big Data (Data Visualization’s Best friend)
• Big Data is an ocean of structured and unstructured data which is too and large and complex to process.
• This data is used to reveal patterns, trends and associations.
TREEMAPS
SCATTER PLOT
GANTT CHART
STEAM GRAPH
DATA VISUALIZATION
Application of Big Data Visualization
Healthcare
E-commerce
Government
Healthcare
http://www.tableausoftware.com/solutions/healthcare-analytics
E-Commerce
E-commerce companies use Big data in two different ways:•Real-time Analysis•Past Behavior of Customer
Government
http://www.transparency.org/gcb2013/results
Data Science In Neuroscience
Use Of Data Science:
Medical Informatics
Biological Sciences
Health Care Social Sciences
Humanities
Neuroinformatics
• Data science in neuroscience- Neuroinformatics.• What is neuroinformatics?
Thunder tool• Developed at the Howard Hughes Medical Institute’s Janelia
research campus.• Built on Apache Spark Platform.• Open-source software .• Runs on Amazon's cloud computing services. • Distributed computing.• Speeds the analysis of large data sets .• Analyze highly-detailed images of the brains.
THANK YOU!!!
Recommended