Advanced Practical Course Data Science (Winter 2019/2020): Difference between revisions

From NET Wiki
Jump to navigation Jump to search
 
(20 intermediate revisions by 3 users not shown)
Line 7: Line 7:
|module=M.Inf.1800 Fortgeschrittenen Praktikum Computernetzwerke
|module=M.Inf.1800 Fortgeschrittenen Praktikum Computernetzwerke
|lecturer=[http://134.76.18.81/?q=people/prof-dr-xiaoming-fu Prof. Xiaoming Fu]; [http://134.76.18.81/?q=people/dr-yali-yuan Dr. Yali Yuan]
|lecturer=[http://134.76.18.81/?q=people/prof-dr-xiaoming-fu Prof. Xiaoming Fu]; [http://134.76.18.81/?q=people/dr-yali-yuan Dr. Yali Yuan]
|ta=[http://www.net.informatik.uni-goettingen.de/people/jiaquan_zhang MSc. Jiaquan Zhang http://www.net.informatik.uni-goettingen.de/?q=people/shuai-xu Shuai Xu]
|ta=[http://www.net.informatik.uni-goettingen.de/people/jiaquan_zhang MSc. Jiaquan Zhang]; [http://www.net.informatik.uni-goettingen.de/?q=people/shuai-xu Shuai Xu]
|time=Friday, 16-18
|time=Friday, 16-18
|place=Ifi 2.101  
|place=Ifi 2.101  
Line 26: Line 26:


Students need to submit their solutions to tasks by specific deadlines throughout the course. Note that this course thus requires a continuous effort throughout the whole semester.
Students need to submit their solutions to tasks by specific deadlines throughout the course. Note that this course thus requires a continuous effort throughout the whole semester.
Solutions for each task have to be presented in class. A final report needs to be submitted at the end of the semester (September 30).
Solutions for each task have to be presented in class. A final report needs to be submitted at the end of the semester.


==Prerequisites==
==Prerequisites==
Line 32: Line 32:
*Knowledge of any of the following languages: Python (course language), R, JAVA, Matlab or any language that features proper machine learning libraries
*Knowledge of any of the following languages: Python (course language), R, JAVA, Matlab or any language that features proper machine learning libraries


----
==Schedule==
==Schedule==
{| {{Prettytable|width=}}
{| {{Prettytable|width=}}
Line 39: Line 41:
|-
|-
| align="right" | 25.10.2019
| align="right" | 25.10.2019
| Lecture 1: Introduction & The Data Science Pipeline
| Lecture 1: Introduction & The Data Science Pipeline - Task 1: Release
|-
|-
| align="right" | 01.11.2019
| align="right" | 01.11.2019
| Lecture 2: The Python Data Science Stack - Task 1: Release
| No lecture
|-
|-
| align="right" | 08.11.2019
| align="right" | 08.11.2019
| No lecture
| Lecture 2: The Python Data Science Stack
|-
|-
| align="right" | 15.11.2019
| align="right" | 15.11.2019
| Task 1: Intermediate meeting  
| Task 1: Intermediate meeting
|-
|-
| align="right" | 22.11.2019
| align="right" | 22.11.2019
| Lecture 3: Advanced Algorithms for Data Science // Task 2: release
| Lecture 3: Advanced Algorithms for Data Science // Task 1 report submission // Task 2: release
|-
|-
| align="right" | 29.11.2019
| align="right" | 29.11.2019
| No lecture // Task 1 report submission
| No lecture  
|-
|-
| align="right" | 06.12.2019
| align="right" | 06.12.2019
| No lecture // Task 3: release
| No lecture  
|-
|-
| align="right" | 13.12.2019
| align="right" | 13.12.2019
| No lecture // Task 2 report submission
| Lecture 4: Evaluation and Tuning of Models // Task 2 report submission // Task 3: release
|-
|-
| align="right" | 20.12.2019
| align="right" | 20.12.2019
| Lecture 4: Evaluation and Tuning of Models
| No lecture
|-
|-
| align="right" | 27.12.2019
| align="right" | 27.12.2019
| Task 3: Intermediate meeting I
| No lecture
|-
|-
| align="right" | 03.01.2020
| align="right" | 03.01.2020
Line 72: Line 74:
|-
|-
| align="right" | 10.01.2020
| align="right" | 10.01.2020
| Task 3: Intermediate meeting II
| Task 3: Intermediate meeting I
|-
|-
| align="right" | 17.01.2020
| align="right" | 17.01.2020
Line 81: Line 83:
|-
|-
| align="right" | 31.01.2020
| align="right" | 31.01.2020
| No lecture // Final slides submission (Only slides)
| Task 3: Intermediate meeting II
|-
|-
| align="right" | 07.02.2020
| align="right" | 31.01.2020  07.02.2020
| Task 3: Presentations (Final Presentation)
| Task 3: Presentations (Final Presentation)
|-
|-
| align="right" | 31.03.2020
| align="right" | 05.05.2020
| Final Report deadline (Including report and code)
| Final Report deadline (Including report and code)
|-
|-
|}
|}

Latest revision as of 15:54, 4 May 2020

Imbox content.png Note: The primary platform for communication in this course will be StudIP. All materials will be uploaded there.


Details

Workload/ECTS Credits: 180h, 6 ECTS
Module: M.Inf.1800 Fortgeschrittenen Praktikum Computernetzwerke
Lecturer: Prof. Xiaoming Fu; Dr. Yali Yuan
Teaching assistant: MSc. Jiaquan Zhang; Shuai Xu
Time: Friday, 16-18
Place: Ifi 2.101
UniVZ link


Course Organization

In this course, you will complete several practical tasks in the realm of data analysis. These tasks can include both exploratory (descriptive) data analysis as well as the application of machine learning algorithms to specific datasets.

While the focus of the course is strongly practical, to support students, the course will provide lectures on different aspects of practical machine learning in the early stages of the course, including:

  • Introduction to the practical machine learning pipeline
  • Exploratory data analysis
  • The Python Data Science stack
  • How to deal with unbalanced data
  • Advanced algorithms for Data Science (an overview of competition winning algorithms)
  • Parameter tuning for predictive models

Students need to submit their solutions to tasks by specific deadlines throughout the course. Note that this course thus requires a continuous effort throughout the whole semester. Solutions for each task have to be presented in class. A final report needs to be submitted at the end of the semester.

Prerequisites

  • You are highly recommended to have completed a course on Data Science (e.g., "Data Science and Big Data Analytics" taught by Dr. Steffen Herbold or the Course "Machine Learning" by Stanford University) before entering this course. You need to be familiar with basic statistics (distributions, p/t/z-tests, etc.) and a range of machine learning algorithms (linear/logistic/lasso regression, k-means clustering, k-NN classification etc.).
  • Knowledge of any of the following languages: Python (course language), R, JAVA, Matlab or any language that features proper machine learning libraries



Schedule

When? What?
25.10.2019 Lecture 1: Introduction & The Data Science Pipeline - Task 1: Release
01.11.2019 No lecture
08.11.2019 Lecture 2: The Python Data Science Stack
15.11.2019 Task 1: Intermediate meeting
22.11.2019 Lecture 3: Advanced Algorithms for Data Science // Task 1 report submission // Task 2: release
29.11.2019 No lecture
06.12.2019 No lecture
13.12.2019 Lecture 4: Evaluation and Tuning of Models // Task 2 report submission // Task 3: release
20.12.2019 No lecture
27.12.2019 No lecture
03.01.2020 No lecture
10.01.2020 Task 3: Intermediate meeting I
17.01.2020 No lecture
24.01.2020 No lecture
31.01.2020 Task 3: Intermediate meeting II
31.01.2020 07.02.2020 Task 3: Presentations (Final Presentation)
05.05.2020 Final Report deadline (Including report and code)