Data Science

matplotlibimage


This image shows a result from the Matplotlib Phyton Library. It shows the humidity (in %), temperature (in °C), pressure (in mb) and rain (in mm) for the month July 2021 in the city of Spa measured every 3 hours. The darkgrey bols mark 12am night en the orange bols mark 12pm noon.
Data from the worldweatheronline Api Click here to see data file



This video shows a cross country analysis of features that correlate with life expectancy. I uploaded the data from UNdata and NationMaster to Power Bi. The visualisation is made with the Line Chart, Pie Chart, Cards and Correlation Plot Visual


Data Science is a combination of statistics and computing.
Statistical methods allow to gain precise knowledge and insight from data. Nowadays statistical analysis is typically done with computers (faster, less mistakes, can deal with big Data). Discovering patterns in Big Data is called data mining. Data mining can be for hypothesis validation, knowledge discovery, descriptive (focus on correlations) or predictive (focus on predictions). Data can be acquired from APIs, web scraping, experiments (2 groups, changing circumstances) or observations.
Data scientists need computer knowledge, namely Database, Machine Learning and the languages Python (used by 57% of the scientists) or R + libraries. Examples of Python libraries are NumPy for array use, Pandas for analysing data, SciPy for scientific computing and Matplotlib for graphic plotting.
Artificial Intelligence means simulation of human intelligence (perception, learning, communication, problem solving,...). AI can be (AI robots) but is not always (industrial robots) built into robots. Machine Learning is a step to AI. It means programming computer to use statistics and learn from data.