“Computational thinking involves solving problems, designing systems, and understanding human behavior, by drawing on the concepts fundamental to computer science. Computational thinking includes a range of mental tools that reflect the breadth of the field of computer science.” Jeannette Wing, “Computational Thinking”
DATA!! You think you have it, but do you really? When you just step into the data world, you must learn the different concepts and terms that relate to data. What is “data”, exactly, to a data scientist?
To process and analyze data, we use software. It is important to understand the difference between algorithms and software, the importance of open source software, and the principles and mechanics of how non-programmers use software.
A “workflow” usually describes how activities are organized to achieve a common goal. What is a computational workflow, and how does it help non-programmers analyze data?
Introduction, Classification. There are many different kinds of data analytics tasks. Are you looking to characterize objects, or just find repeating patterns on your data? Classification is a common data analytics task, which involves assigning a category (i.e., a class) for a new instance.
Pattern Discovery, Clustering, Simulation. Discovering patterns in data, clustering objects, and simulating scenarios for prediction are all data analytics tasks.
Causality Causal models of data can help understand complex phenomena beyond simple correlation. Causal discovery is one of the most advanced and challenging data analytics tasks.
The majority of the time in a data science project is spent preparing the data for processing, by cleaning, reformatting, and reorganizing data. There are many possible steps that you can take to get data ready for processing.