top of page
Writer's pictureAnsiya Nasar

EXPLORING DATA SCIENCE




WHAT IS DATA SCIENCE?

  • Data science is a simple process that extracts useful information from data.

  • 2.5 quintillion bytes of data are produced every day. It will process and analyze the entire data and extract what is needed for business growth.




WHICH CODING LANGUAGES AND SKILL SETS ARE REQUIRED FOR LEARNING DATA SCIENCE?


  1. Python - It's a popular language, which is used in data science for data analysis, machine learning, and visualization.


  2. SQL (Structured Query Language) - It is a programming language, which is used to create and make changes as users need in a data set. We can insert, delete, and update data sets.


  3. Statistics - By using various statistical methods, we will be able to get the meaning and information from the data easily.


  4. Data extraction and Processing – We get data from different sources, and here we make the data in a format so that we can analyze it easily.


  5. Data Cleaning - Data cleaning is the process of improving the quality of the data by removing the null values, inconsistent values, etc.


  6. Machine Learning - It is the process of designing algorithms, that can learn from data automatically, identify patterns, and make decisions.


  7. Data visualization - It is the process of creating graphical representations of data to understand, analyze, and communicate the information and patterns.



THE FOLLOWING ARE THE STEPS IN THE DATA SCIENCE PROCESS:


  1. Setting the Research Goal

  2. Retrieving Data

  3. Data Preparation

  4. Data Exploration

  5. Data Modeling

  6. Presentation








Comments


bottom of page