Course syllabus

Course-PM

DAT566 / DIT408 - Introduction to Data Science and AI - LP1 HT25 (7.5 hp)

Course is offered by the Department of Computer Science and Engineering.

 

Contact details

Lecturer/Examiner: Assist. Professor Rocío Mercado Oropeza <rocio.mercado@chalmers.se>
Lecturer: Assist. Professor Stefano Sarao Mannelli  <s.saraomannelli@chalmers.se>
Lecturer: Dr. Oana Geman <geman@chalmers.se>
Lecturer: Dr. Sandro Stucki <sandro.stucki@chalmers.se>
Lecturer: Dr. Bastiaan Bruinsma <jacobuss@chalmers.se>
Lecturer: Dr. Farzaneh Jalalypour <farjal@chalmers.se>
Responsible for grading and TAs:  Chenxiao Ma <mache@chalmers.se>

Additional contact information is available on the Home page for student TAs and course representatives.

 

Course purpose

The course gives a broad introduction to various techniques and theories used in Data Science and AI, with particular focus on their practical applications.

 

Schedule

The course includes 2 lectures (hybrid: in-person HA4 + Zoom), as well as 2 laboratory sessions (hybrid), each week.

The schedule for lectures and sessions varies due to holidays. For the latest information, please see: TimeEdit

For an overview of all lectures, lab sessions, and responsible Teaching Assistants, please see: Front page

 

Assignment/quiz schedule

Assignment/quiz #

Topic

Released

Deadline

Re-submission deadline

Module 1 Assignment

Weather

Mon 01 Sep 25 08:00

Mon 08 Sep 25 23:59

Mon 22 Sep 25 23:59

Module 2 Assignment

Apartments

Mon 08 Sep 25 08:00

Mon 15 Sep 25 23:59

Mon 29 Sep 25 23:59

Module 3 Assignment

Diabetes

Mon 15 Sep 25 08:00

Mon 22 Sep 25 23:59

Mon 06 Oct 25 23:59

Modules 1-4 Quiz

Quiz 1

Mon 22 Sep 25 08:00

Mon 29 Sep 25 23:59

Mon 06 Oct 25 23:59

Module 5 Assignment

Seeds

Mon 29 Sep 25 08:00

Mon 06 Oct 25 23:59

Mon 20 Oct 25 23:59

Module 6 Assignment

Fashion MNIST

Mon 06 Oct 25 08:00

Mon 13 Oct 25 23:59

Mon 27 Oct 25 23:59

Module 7 Assignment

RAG with Ollama

Mon 13 Oct 25 08:00

Mon 20 Oct 25 23:59

Mon 03 Nov 25 23:59

Modules 5-8 Quiz

Quiz 2

Mon 20 Oct 25 08:00

Mon 27 Oct 25 23:59

Mon 03 Nov 25 23:59

All times/dates are in Gothenburg, SE time. 

 

The final exam will be held on Tues 28 Oct 25 in the morning and assess your mastery of the course content spanning all 8 modules. Logistical details on the exam (room number, time) will come later and be updated here.

 

Lab sessions

Lab sessions will be held hybrid, with TAs available to help you both over Zoom, and in ED3354 and ED3358.

For the lab sessions on Zoom, this Wagly is used as waiting list.

When you create a help request, first create a Zoom room and put the room id (the 10 or 11 digit number) as your name. A TA will then join the room. Do not enable a password for the room.

 

Course literature

Skiena, Steven S. (2017). The Data Science Design Manual. Springer.

Available through Chalmers Network and Library at https://link.springer.com/book/10.1007/978-3-319-55444-0

Additional literature can be found in the respective modules.

 

Course design

There is no mandatory attendance at either the Lectures or Laboratory sessions. Come if you see value in it and would like to interact with your lecturers and TAs, who put a lot of work into preparing interesting and engaging course material for you.

Laboratory sessions consist of independent work on the assignments and there will be course staff available to help. The teaching assistants will primarily answer questions related to the assignment of the week, and you are assumed to have attended the lectures and done the weekly readings. General questions about how to install Python (and similar) will only be answered if there is time. There are plenty of resources for this online, please see material for weeks 1-2 of the course.

Lectures are to be held hybrid, in-person in HA4 and over Zoom. There is required reading attached for each lecture which you are expected to read before the lecture. Lectures will assume you have read the literature and will not repeat it.

Examination form

The course consists of two assessment components, each of which must be passed in order to pass the course. The course is graded pass/fail (G/U).

1. Written assignments and quizzes:

  • The course has 6 mandatory weekly assignments and 2 quizzes (4.0 credits), all of which must be passed to pass this module.
  • An assignment/quiz is considered a pass if you have obtained at least 70% of the maximum score.
  • All assignments are to be done and submitted in groups of two students and you need to select your group within CodeGrade.
    • Exceptions may be requested from the Examiner in the first two weeks of the course only, but will only be granted under exceptional circumstances.
    • Assignments are released on the first day of each module, with the initial deadline one week AFTER initial release.
    • Most assignments are returned as Jupyter notebooks on CodeGrade. Some parts of the assignment will be automatically graded, which means you will get immediate feedback for those parts. You will receive feedback and grading one week after the initial deadline for all assignments for the manually graded component.
    • Re-submissions do not receive feedback, only grading.
  • Quizzes are to be done and submitted individually. No exceptions.
    • The quizzes are multiple-choice Canvas quizzes. You have a maximum of 2 attempts for completing the quiz with a passing grade (the first submission, and the re-submission).
    • There is a 30-minute time limit for the Canvas quizzes.
    • For quizzes, automated item-level feedback may appear per Canvas settings after each attempt. You may also come to lab sessions to ask questions before using up your second attempt.
    • If you fail the quiz by the first submission deadline, you have one more week to try again with your second attempt.
  • If you do not pass each assignment/quiz, you can resubmit once. The re-submission deadline is 1 week after you receive your grades and listed above in the Syllabus. After this, the assignment/quiz is closed.
  • All deadlines are hard. Late submissions are considered re-submissions (and thus receive no manual feedback).
  • If you fail the re-submission, you will need to complete that assignment/quiz again in a later instance of the course (e.g., LP2).
  • Extensions are only given for valid reasons such as illness, serious family issues, etc.not for holidays or university trips.
    • To request an extension, you are required to email the Examiner BEFORE the initial deadline (noon before assignment is due at latest).
    • Do not assume that if you request an extension, that it will be granted.
  • The use of generative AI tools (e.g., ChatGPT, DeepSeek, Claude, co-pilots) in carrying out assignments, unless where specifically instructed to use them (e.g., Module 7 Assignment: RAG with Ollama), is strictly prohibited and is considered plagiarism in this course. The use of generative AI to write-up assignments is also forbidden and considered plagiarism.

2. Hall exam:

  • The course has 1 mandatory digital hall exam (3.5 credits). The exam is graded pass/fail.
  • The exam will be multiple choice and considered a pass if you obtain at least 70% of the maximum score.
  • Exam date: Tuesday October 28, 2025 in the morning (4 h).
    • If you cannot make the above exam date, or if you fail the exam, then you may take the exam in another course instance.
  • The examination will be conducted as a digital examination via Inspera. It will be a multiple-choice exam and automatically graded.
  • No aids will be permitted during the examination (e.g., no markings, index cards, or notes).
  • The use of generative AI tools (e.g., ChatGPT, DeepSeek, Claude, co-pilots) in the exam is strictly prohibited and is considered plagiarism in this course.

Students with approved accommodations should contact the examiner and the examiner will coordinate suitable arrangements.

 

Plagiarism policy

You are not allowed to copy pieces of code from students of other groups. You may discuss the problems, but you may not share code.

You may not publish your solutions. Do not put your code into a public GitHub repository, for example.

If you use materials you find in the Internet (e.g., Wikipedia, Stack Overflow, Reddit discussions), you must attribute the source. Finding matching code snippets without proper attribution means you are presenting others' work as your own, and is considered plagiarism.

The use of generative AI tools (e.g., ChatGPT, DeepSeek, Claude, co-pilots, etc.) in preparing assignments/quizzes is considered plagiarism.

Cases where plagiarism is spotted will be deferred to the disciplinary committee of the university and may lead to suspension.

 

Changes since last time

Since the last learning period (LP4 VT25), the course has introduced a written hall examination worth 3.5 credits. 

 

Learning objectives

On successful completion of the course the student will be able to:

Knowledge and understanding

  • describe fundamental types of problems and main approaches in data science and AI
  • give examples of data science and AI applications from different contexts
  • give examples of how stochastic models and machine learning (ML) are applied in data science and AI
  • explain basic concepts in classical AI, and the relationship between logical and data driven, ML-based approaches within AI
  • briefly explain the historical development of AI, what is possible today and discuss possible future development.

Skills and abilities

  • use appropriate programming libraries and techniques to implement basic transformations, visualizations and analyses of example data
  • identify appropriate types of analysis problems for some concrete data science applications
  • implement some types of stochastic models and apply them in data science and AI applications
  • implement and/or use AI-tools for search, planning and problem solving
  • apply simple machine learning methods implemented in a standard library

Judgement and approach

  • justify which type of statistical method is applicable for the most common types of experiments in data science applications
  • discuss advantages and drawbacks of different types of approaches and models within data science and AI.
  • reflect on inherent limitations of data science methods and how the misuse of statistical techniques can lead to dubious conclusions
  • critically analyze and discuss data science and AI applications with respect to ethics, privacy and societal impact
  • show a reflective attitude in all learning

 

Re-examination instructions

Students re-taking the course from a previous failed instance of the course should consult the instructions for re-examination.

 

Link to the syllabus on Studieportalen: DAT566, DIT408