Upload
anushri3
View
219
Download
0
Embed Size (px)
Citation preview
ETL-PROJECT
PRESENTED BY :-MONIKA VERMAANNUSHRI SHARMARATI LODHA
PRESENTED TO:-SHRADHHA MASIH
WEBLOG -DATA
OUTLINES Introduction Process About the project Snap sorts
Input data Performing validation Performing transformation Writer Output data
ETL:- INTRODUCTION ETL is short for extract, transform, load,
three database functions that are combined into one tool to pull data out of one database and place it into another database.
WHAT IS ETL? ETL = Extract – Transform – Load
Extract › Get the data from source system as
efficiently as possible Transform › Perform calculations on data Load › Load the data in the target storage
PROCESS
ABOUT THE PROJECT:-
The project is based on WEB-LOG DATA LOG FILE - A log file is a file that records,
either events that occur in an operating system or other software runs, or messages between different users of a communication software.
Logging is the act of keeping a log. In the simplest case, messages are written to a single log file.
ABOUT THE PROJECT:-
We have motivated to do this project as our initiative to learn the ETL process. It is basically working on the cleaning process of a system’s log file of Internet uses through ETL Tool which is provided to us.
In this, we go through all phases of Advance ETL Processor Tool and generated our cleaned and formatted data in spreadsheet as our output.
In this, we have applied multiple inbuilt functions on our Weblog Data.
Input log file
Starting of the project with etl different phase:-
Performing Validation :
Performing transformation :
Write the data through writer
Writer
OUTPUT OF CLEAN DATA
Thanks!!!!!