IPLUSO 23558
Statistics for Data Science
Computer Applications for Data Science
-
ApresentaçãoPresentationThis course introduces the fundamental concepts of statistics for data science, including probability distributions, sampling techniques, statistical inference, exploratory data analysis and curve fitting. In addition, the course introduces statistical modeling techniques, including linear regression and analysis of variance, as well as multivariate analysis techniques, such as principal component analysis and clustering.
-
ProgramaProgramme1. introduction to statistics and probability 2. Sampling techniques 3. Statistical inference 4. Exploratory data analysis 5. Statistical decision theory, hypothesis tests and significance tests. 6. Theory of small samples. Student's t distribution and Chi square distribution. 7. Curve fitting and the least squares method 8. Statistical modeling 9. Multivariate analysis
-
ObjectivosObjectives1. Understand the fundamental concepts of probability and probability distributions; 2. Apply sampling techniques to collect and analyze data; 3. Perform statistical inference, including hypothesis testing and confidence intervals; 4. Perform exploratory data analysis using graphs and descriptive statistical measures; 5. Apply statistical modeling techniques, including linear regression and analysis of variance; 6. Apply multivariate analysis techniques, such as principal component analysis and clustering.
-
BibliografiaBibliographyJames, G., Witten, D., Hastie, T., & Tibshirani, R. (2013). An introduction to statistical learning: with applications in R. Springer. Steele, B., Chandler, J., & Reddy, S. (2018). Statistics for data science: A comprehensive introduction. O'Reilly Media, Inc. Gelman, A., & Hill, J. (2006). Data analysis using regression and multilevel/hierarchical models. Cambridge University Press.
-
MetodologiaMethodologyThe teaching methodology will be based on lectures, practical examples and exercises, as well as the use of data analysis software (e.g. R commander or Jamovi). Students will also be encouraged to develop practical projects involving the analysis of real data, with a focus on interpreting and communicating the results. Assessment will be based on individual and group work, as well as two tests. Students will also be assessed on their ability to apply statistical concepts to real problems and their ability to communicate data analysis results clearly and concisely with appropriate technical terminology.
-
LínguaLanguagePortuguês
-
TipoTypeSemestral
-
ECTS6
-
NaturezaNatureMandatory
-
EstágioInternshipNão




