Facts of Data in Foundation of Data science

Define Facts of Data

        Data Science is focused on fact of Data in Foundation  of  Data science, It can be Processing of complex datasets, of the integrated dataset. Building predictive models from those data. The dataset's are classified into  

  •    Upstream process
  •    Downstream process           

Upstream process

Facts of Data in Foundation of Data science in Upstream process


Downstream process

Facts of Data in Foundation of Data science in Downstream process

There are many facts of data science

  1. Identifying the structure of data Cleaning, Filtering, Re-organising, Augmenting and aggregating data.
  2. It can be Visualizing data.
  3. Data analysis ,Statistic's and modelling , Machine learning.
  4. Assembling data processing pipelines to link these steps
  5. Leveraging high end computational resources for large scale problems.

Main categories of Data

  • Structured
  • Unstructured
  • Natural Language
  • Machine - generated
  • Graph-based
  • Audio ,Video and Images
  • Streaming

Structured Data

    Hierarchical data such as family tree is also called Structured data as they are stored in a particular structure.

Example


Facts of Data in Foundation of Data science in Structured Data


Unstructured Data

    Unstructured data is data that is not easy to fit into a data model because the content is content-specific or varying

Facts of Data in Foundation of Data science in UnStructured Data

Natural Language

    It is a special type of unstructured data , It is challenge to process because it required knowledge of specific data science techniques and linguistics

Example

                Alexa, Apple Siri, Chat GPT


Machine generated Data

    Machine generated data is the information that's automatically created by a computer , process, application or other machine without Human intervention.

Example

                Random number generator, OTP generator etc.

Graph based Data

       Graph is a mathematical structure to model pair wise relationship between objects. Graph focus on relationship or adjacency of objects.


Facts of Data in Foundation of Data science in Graph based Data

Audio, Image and Video Data

        Video, image and audio are data type that pose specific challenge to a data scientist. Task such as recognizing objects in pictures are more challenging for computers


Facts of Data in Foundation of Data science in Audio, Video and Image

Streaming Data

        Data streaming is data that continuously flows from a source to a destination to be processed and analyzed in near real time.

Facts of Data in Foundation of Data science in Streaming Data


Conclusion

        Facts of Data in Foundation of Data Science contains the highlights and answers to the problem statement objectives and hypothesis




0 Comments

Post a Comment

Post a Comment (0)

Previous Post Next Post