Intern
Zentrum für Philologie und Digitalität "Kallimachos"

MA 45: Double Major

First Semester

  • Lecture: Deep Learning (Katharina Breininger), see WueStudy
  • Exercise (Katharina Breininger), see WueStudy
  • Tutorium (Christof Weiß)

The lecture provides advanced knowledge of deep learning techniques such as FCN, CNN and LSTMs, practical application examples for NN architectures, e.g. in the field of image and speech processing. Current models and methods of machine learning and their technical background are presented. Building on this, models from the field of deep learning, such as CNNs, RNNs and sequence-to-sequence architectures, are discussed. The theoretical foundations of these models, such as training through backpropagation, are also discussed in detail. For all the models covered, it is shown how they are used in practice for specific problems such as image processing and text generation.

Goals

Students have knowledge of the possible applications and limitations of deep learning, of important architectures and how they are implemented in typical tools, of the ability to reprogram network structures from the literature, of data preparation and of solving concrete tasks.

The course teaches the skills needed for the systematic analysis of written cultural data, e.g., literary texts or texts from social media. This includes the following tasks: Formulating a research hypothesis based on existing research and developing a research design to test it, automated extraction of specific text features including evaluation of the extraction method, and statistical analysis of the data.

Goals

Students are able to independently implement at least one typical research design in CH, make informed decisions about the extraction and analysis methods to be used, and implement them technically.

Second Semester

  • Languages of the world: language families, typology, etymology.
  • Linguistic universals: words, morphology, partsof-speech, syntax. Alphabets (scripts), encoding, and language identification.
  • Multilingual word representation spaces (aka cross-lingual word embeddings).
  • Transformer architecture and Pretrained (multilingual) Language Models.
  • Machine translation.
  • Multilingual resources: unlabeled corpora, lexico-semantic networks and word translations, parallel corpora.
  • Cross-lingual transfer: from word alignment and label projection, over MT-based transfer to zero-shot and few-shot transfer with multilingual Transformer-based language models.
  • Advanced topics: curse of multilinguality, modularization and language adaptation, multilingual sentence encoders, contextual parameter generation, multi-source transfer, gradient manipulations.

Goals

Students will acquire theoretical and practical knowledge on modern multilingual natural language processing and also get an insight into cutting edge research in (multilingual) NLP. They will learn how to represent texts from different languages in shared representation spaces that enable semantic comparison and cross-lingual transfer for various NLP tasks. Upon successful completion of the course, the students will be well-equipped to solve practical NLP problems regardless of the language of the text data, and to determine the optimal strategy to obtain best performance for any concrete target language.

Format

Lecture with Exercise and Tutorium.

Courses in WueStudy include:

Multilingual Natural Language Processing (Goran Glavaš)

Machine Learning for Natural Language Processing (Andreas Hotho)

Processing and discussion of exemplary research questions using computational humanities methods, with a focus on corpus analysis of non-textual data such as audio, music, image, video, or 3D data.

Goals

Students are able to answer research questions in computational humanities and to carry out and evaluate corpus analyses of non-textual data.

 

Third Semester

The research project gives students the opportunity to independently apply what they have learned so far to a topic of their own choosing. Ideally, they should work on a research question from the formulation of the research hypothesis to data collection and analysis, or at least complete a significant step in the process.

Goals

Students are able to work on a problem in the CH, develop procedures for solving it, implement these in appropriate steps, and present the results.

 

additionally, choose 5 ECTS from the electives listed below.

Fourth Semester

In the 4th semester, you should write your master’s thesis and finally present it in the colloquium.

With a double major, you are free to decide in which of your majors you would like to write your thesis (or whether you find a subject that fits both subjects).

Mandatory Electives

Advanced techniques for the analysis of multimodal data (e.g. audio/music processing, image processing) using machine learning methods. Discussion and evaluation of such methods in the context of the computational humanities.

Goals

Students have a fundamental understanding of the respective data types as well as theoretical and practical knowledge in the field of multimedia processing. They have gained experience with typical tasks and are able to understand, apply, further develop and evaluate the algorithms.

Lectures

Music Information Retrieval (Christof Weiß) (Exercise)

There will be additional offers in the future, e.g., from the area of Computer Vision.

If you are studying the single major (120 ECTS) and visit two of the alternatives, the second course can be accounted for as mandatory elective.

The research project gives students the opportunity to independently apply what they have learned so far to a topic of their own choosing. Ideally, they should work on a research question from the formulation of the research hypothesis to data collection and analysis, or at least complete a significant step in the process.

Goals

Students are able to work on a problem in the CH, develop procedures for solving it, implement these in appropriate steps, and present the results.

 

TODO

Cultural data and research data from the cultural sciences and humanities often pose special challenges in terms of indexing, management and preservation. The data should often be usable for a long period of time and be available for very different applications, if possible also for scenarios that were not considered when the data was created. The seminar teaches relevant principles and techniques. 

Objectives

Students understand the challenges of cultural data management, can model cultural data and design and implement techniques for its management.

S (2) • 5 ECTS • NUM • Faculty of Arts, Historical, Philological, Cultural and Geographical Studies

 

Digital editions make historical documents accessible and prepare them for the questions of a scientific audience. The module teaches the principles and techniques of designing, editing and presenting digital editions.

Objectives

Students understand the functions and characteristics of digital editions and can independently take on roles in the conception, presentation or preparation of digital editions.

S (2) • 5 ECTS • NUM • Faculty of Arts, Historical, Philological, Cultural and Geographical Studies
 

Data annotation, i.e., the linking of concepts from the humanities and cultural studies with cultural data, is an essential tool for the development and evaluation of automatic processes in the CH. The seminar teaches the relevant work process from the development of annotation guidelines to their technical implementation in an annotation environment, the training of annotators, and the calculation of measures of inter-annotator agreement.

Goals

Students can independently develop an annotation and implement it themselves or supervise its implementation.

Due to their close links to computer science and AI, CH are developing particularly rapidly, which makes it especially important to keep knowledge up to date. Current research trends are discussed using a selected example, e.g., the development of a new form of information representation, information extraction, or data analysis.

Goals

Insight into current research on a selected topic of CH. Acquisition of the competence to compile and understand current research on a selected topic.

New research methods for the computational humanities.

Goals

Students have specialized knowledge of new research methods in computational humanities. They can understand, apply and evaluate these methods.