Natural Language Processing

Members

Name				Position
Dr. Matthias Aßenmacher				Lead
Helen Alber				PhD Student
Esteban Garces Arias				(External) Collaborating PhD Student
Matthias Schöffel				(External) Collaborating PhD Student

Alumni

Name				Position				From	To
Stefanie Urchs				(External) Collaborating PhD Student				01/23	01/26
Michael Sawitzki				Student Assistant (DL4NLP Lecture)				04/24	08/25

Research

This group focuses on methodological and applied research in the context of natural language processing (NLP), including (but not limited to) the following topics:

Reproducibility/Comparability/Benchmarking of LLMs
Active Learning for NLP
Decoding Strategies for LLMs
Resources and Evaluation
Bias and Stereotypes
Multi-Modal Deep Learning
Uncertainty quantification

We have ongoing collaborations with the Bavarian Academy of Sciences (M. Schöffel), the MISODA working group at LMU (C. Heumann, E. Garces Arias), and the University of Applied Sciences Munich (V. Thurner, S. Thiemichen, S. Urchs).

Teaching

We are actively developing the Deep Learning for Natural Language Processing (DL4NLP) course together with colleagues from LMU Munich and the University of Vienna.

NLP Colloquium

We regularly organize the “NLP Colloquium” (intended to happen three times a year) where all of our thesis and consulting students present their work to each other. This is an internal event where only current thesis or consulting students are present.
Besides this, we organize interesting (smaller) events in between the regular colloquium slots. This encompasses talks by PhD students on their recent work ot talks by people from other places whom we happen to know. This is open to everyone.
If you are not a thesis/consulting student in our group but are interested in the talks, feel free to reach out, and we will add you to the mailing list.

Students / Thesis supervision

Supervised Theses/Projects (since 01/2022)

Title	Type	Completed
Evaluating Large Language Models for Crosslingual Text Summarization of Historical Documents	MA	2026
Evaluation of AI-generated Business Process Models for Procurement Processes based on Natural Language Input	MA	2026
Benchmarking Large Language Models for Automated Insurance Claims Coverage Review	Consulting	2026
Evaluating LLMs' Adherence to Prompt-Injected Big 5 Personality Traits	BA	2026
Dokumentenklassifikation in der Generali Deutschland Krankenversicherung AG im Bereich Ambulant	Consulting	2026
Consistency of Large Language Models in Multiple-Choice Question Answering: An Empirical Evaluation across Varying Scales	BA	2026
Automated Tender Analysis using LLMs for AI & Data Consulting Opportunities	Consulting	2026
From Miss to Meaning: A Classification and Interpretability Analysis of Retrieval Error in Retrieval-Augmented Generation	MA	2025
Lost in Translation? Exploring the Shift in Grammatical Gender from Latin to Occitan	MA	2025
A Metadata-aware Retrieval Augmented Q&A Application for Academic Texts	MA	2025
Multimodal Semantic Search for Intelligent Product Findability	Consulting	2025
Clustering Disaggregated Conflict Sequences in Africa Based on Complex Spatio-Temporal Data	Consulting	2025
Understanding Temporal Structure in Text - A Study of Positional Encodings in LLMs	MA	2025
Exploring the Potential of Fine-Tuning Language Models for Domain Adaptation in German Radiology Reports	MA	2025
Exploring the Impact of Reinforcement Learning with Rule-Based Rewards on Latent Space Reasoning in Language Models	MA	2025
Derivatives in External Statistics: Assessment of Regulatory Data using Natural Language Processing	MA	2025
Automated Error Detection in Radiology Reports	Consulting	2025
Benchmarking Retrieval-Augmented Generation Systems with Small Language Models on Domain-Specific Data	MA	2025
Machine Learning Methods for Text-Based Fraud Detection	MA	2025
Automated Data Collection of Weapons Endowments of Rebel Organizations	Consulting	2025
Transformer-Based Representation Learning for Human Behavior Modeling From Smartphone Sensing Sequences	MA	2025
Enhancing Supervisory Document Analysis for the Cooperative Banking Sector	Consulting	2025
An Automated Pipeline for the Collection of Data on the Transfer and Export of Small Arms and Light Weapons	Consulting	2025
Combining Large Language Models and Topic Clustering for Metadata-enriched Temporal Evolution Path Detection	MA	2025
Comparing Metrics for the Evaluation of Decoding Strategies on Text Summarization Tasks	BA	2025
Guided Topic Modeling for Customer Feedback Analysis: Incorporating Prior Information and Document-specific Covariates	MA	2025
Enhancing Information Retrieval Via Cognitively Motivated Document Expansion	MA	2024
Übersicht über zertifizierte industrielle KI-Produkte	Consulting	2024
Clustering Embeddings from Large Language Models for Retrospective Event Detection	MA	2024
Synthetic Opinions: Utilizing Large Language Models for Generating Responses to Open-Ended Survey Questions	MA	2024
Text-based geographical assignment of tweets	Consulting	2024
Exploring Hyperparameter Selection Strategies for Topic Clustering with Large Language Models	MA	2024
Exploring Strategies for Informed Initial Pool Selection in Deep Active Learning with Pre-Trained Language Models	MA	2024
NLP in Insurance: Leveraging Language Models to Automatize Disease Classification	Consulting	2024
Automatic transcription of handwritten Franconian using Deep Learning	MA	2024
Advanced Knowledge Editing in Large Language Models	MA	2023
Robust, Explainable, and Unbiased Text Classification of Insurance Claims	MA	2023
Topic Classification of News Headlines	Consulting	2023
Integrating Domain Knowledge into Transformer-based Approaches to Vulnerability Detection	MA	2023
Transformer-Based Language Models for Multiple Choice Question Answering	BA	2023
Interslavic Natural Language Processing	MA	2023
A Comparative Study of Large Language Models for Text-to-Code Generation	BA	2023
ICON: ICD-10 Coding using Natural Language Processing	MA	2023
Quantization in Large Language Models	MA	2023
Natural Language Processing for Systematic Literature Reviews: An Application to Immersive Design Research	Consulting	2023
Digitizing Handwritten Old Occitan Cards using Vision and Language Models	MA	2023
Enhancing stance prediction by utilizing party manifestos	MA	2023
A tailored OCR-System for the Medieval Latin Dictionary for the Bavarian Academy of Sciences and Humanities	Consulting	2023
Application of neural topic models to Twitter data from German politicians	BA	2023
Examining and Mitigating Gender Bias in German Word Embeddings	BA	2023
Domain transfer across country, time, and modality in multiclass classification	BA	2022
How Different is Stereotypical Bias in Different Languages? Analysis of Multilingual Language Models	MA	2022
Predicted Sentiments of Customer Texts as Covariates for Time Series Forecasting	MA	2022
A Comparative Evaluation of the Utility of linguistic Features for Part-of-Speech-Tagging	BA	2022
Evaluating pre-trained language models on partially unlabeled multilingual economic corpora	MA	2022
Leveraging pairwise constraints for topic discovery in weakly annotated text data	MA	2022
Word Embedding Evaluation with Intrinsic Evaluators	MA	2022

A selection of older theses/projects supervised (partly) together with Christian Heumann can be found here.

If you are interested in writing your thesis under our supervision, please include the following information in your e-mail
- your field of interest and at least a tentative idea for the direction of a potential thesis topic
- a CV, and your current transcript of records
- a planned starting date for your thesis (you should also bring some time for developing and refining a research idea, so do not expect to start in e.g. one week)
Disclaimer: Before you apply for a thesis topic regarding NLP make sure that you fit the following profile:
- Willingness and ability to engage in a topic which (potentially) requires a notable amount of self-study, since it is normally not part of the regular curriculum of your studies in statistics.
- Readiness to do quite some programming (most probably in Python)
- Please include the following information in the email mentioned above: Previously experience/attended classes on NLP, deep learning, machine learning, and programming.
- This is not meant to discourage you from writing your thesis on NLP, but rather to get expectations straight in advance.
If you want to apply for supervision of an external thesis, please also include the following information in your email:
- A clear formulation of the thesis goal from an academic perspective of ~1 page (It should not be a pure business case, such projects are better suited for e.g. the Consulting module)
- Information on the external partner, data availability (detailed please), computational resources supplied by the project partner (if applicable)
- Again: Not meant to discourage you or to set any artificial barriers, but to get expectations/goals straight in advance.

Publications

Zehle T, Heiß T, Schlager M, Aßenmacher M, Feurer M (2026) promptolution: A Unified, Modular Framework for Prompt Optimization Accepted to EACL 2026 System Demonstrations, Rabat, Morocco.
link|pdf.
Zehle T, Aßenmacher M (2026) Can Calibration of Positional Encodings Enhance Long Context Utilization? Accepted to EACL Findings 2026, Rabat, Morocco.
Özeren E, Ulbrich A, Filimon S, Rügamer D, Bender A (2026) Enhancing Traffic Accident Classifications: Application of NLP Methods for City Safety. In: In: Dutra I , In: Pechenizkiy M , In: Cortez P , In: Pashami S , In: Pasquali A , In: Moniz N , In: Jorge AM , In: Soares C , In: Abreu PH , In: Gama J (eds) Machine Learning and Knowledge Discovery in Databases. Applied Data Science Track and Demo Track, pp. 180–195. Springer Nature Switzerland, Cham.
link|pdf.
Urchs S, Thurner V, Aßenmacher M, Heumann C, Thiemichen S (2025) Fair Play in the Newsroom: Actor-Based Filtering Gender Discrimination in Text Corpora Proceedings of the 5th Workshop on Evaluation and Comparison of NLP Systems, pp. 55–65. Association for Computational Linguistics, Mumbai, India.
link|pdf.
Özeren E, Aßenmacher M (2025) Reinforcement Learning for Latent-Space Thinking in LLMs arXiv preprint 2512.11816,
link|pdf.
Amin M, Aßenmacher M (2025) Do Companies Reveal Their Own Fraud? - A Novel Data Set for Fraud Detection Based on 10-K Reports Proceedings of The 10th Workshop on Financial Technology and Natural Language Processing, pp. 148–166. Association for Computational Linguistics, Suzhou, China.
link|pdf.
Gruber C, Alber H, Bischl B, Kauermann G, Plank B, Aßenmacher M (2025) Revisiting Active Learning under (Human) Label Variation Proceedings of the The 4th Workshop on Perspectivist Approaches to NLP, pp. 75–86. Association for Computational Linguistics, Suzhou, China.
link|pdf.
Ding Y, Garces Arias E, Li M, Rodemann J, Aßenmacher M, Chen D, Fan G, Heumann C, Zhang C (2025) GUARD: Glocal Uncertainty-Aware Robust Decoding for Effective and Efficient Open-Ended Text Generation Findings of the Association for Computational Linguistics: EMNLP 2025, pp. 7202–7226. Association for Computational Linguistics, Suzhou, China.
link|pdf.
Garces Arias E, Blocher H, Rodemann J, Aßenmacher M, Jansen C (2025) Statistical Multicriteria Evaluation of LLM-Generated Text Proceedings of the 18th International Natural Language Generation Conference, pp. 338–351. Association for Computational Linguistics, Hanoi, Vietnam.
link|pdf.
Urchs S, Thurner V, Aßenmacher M, Bothmann L, Heumann C, Thiemichen S (2025) Are All Genders Equal in the Eyes of Algorithms? – Analysing Search and Retrieval Algorithms for Algorithmic Gender Fairness Proceedings of the 17th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - KDIR, pp. 489–500. SciTePress.
link.
Garces Arias E, Blocher H, Rodemann J, Li M, Heumann C, Aßenmacher M (2025) Towards Better Open-Ended Text Generation: A Multicriteria Evaluation Framework Proceedings of the Fourth Workshop on Generation, Evaluation and Metrics (GEM^2), pp. 631–654. Association for Computational Linguistics, Vienna, Austria.
link|pdf.
Stephan A, Zhu D, Aßenmacher M, Shen X, Roth B (2025) From Calculation to Adjudication: Examining LLM Judges on Mathematical Reasoning Tasks Proceedings of the Fourth Workshop on Generation, Evaluation and Metrics (GEM^2), pp. 759–773. Association for Computational Linguistics, Vienna, Austria.
link|pdf.
Ma B, Yoztyurk B, Haensch A-C, Wang X, Herklotz M, Kreuter F, Plank B, Aßenmacher M (2025) Algorithmic Fidelity of Large Language Models in Generating Synthetic German Public Opinions: A Case Study Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 1785–1809. Association for Computational Linguistics, Vienna, Austria.
link|pdf.
Urchs S, Thurner V, Aßenmacher M, Heumann C, Thiemichen S (2025) taz2024full: Analysing German Newspapers for Gender Bias and Discrimination across Decades Findings of the Association for Computational Linguistics: ACL 2025, pp. 10661–10671. Association for Computational Linguistics, Vienna, Austria.
link|pdf.
Debelak R, Koch T, Aßenmacher M, Stachl C (2025) From Embeddings to Explainability: A Tutorial on Transformer-Based Text Analysis for Social and Behavioral Scientists. Advances in Methods and Practices in Psychological Science 8.
link|pdf.
Schöffel M, Garces Arias E, Wiedner M, Ruppert P, Li M, Heumann C, Aßenmacher M (2025) Unveiling Factors for Enhanced POS Tagging: A Study of Low-Resource Medieval Romance Languages arXiv preprint 2506.17715,
link|pdf.
Rauch L, Wirth M, Huseljic D, Herde M, Sick B, Aßenmacher M (2025) No Free Lunch in Active Learning: LLM Embedding Quality Dictates Query Strategy Success arXiv preprint 2506.01992,
link|pdf.
Zhang C, Wu S, Chen Y, Aßenmacher M, Heumann C, Men Y, Fan G, Gama J (2025) OBD-Finder: Explainable Coarse-to-Fine Text-Centric Oracle Bone Duplicates Discovery arXiv preprint 2505.03836,
link|pdf.
Schöffel M, Wiedner M, Garces Arias E, Ruppert P, Heumann C, Aßenmacher M (2025) Modern Models, Medieval Texts: A POS Tagging Study of Old Occitan Proceedings of the 5th International Conference on Natural Language Processing for Digital Humanities, pp. 334–349. Association for Computational Linguistics, Albuquerque, USA.
link|pdf.
Wuttke A, Aßenmacher M, Klamm C, Lang MM, Würschinger Q, Kreuter F (2025) AI Conversational Interviewing: Transforming Surveys with LLMs as Adaptive Interviewers Proceedings of the 9th Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature (LaTeCH-CLfL 2025), pp. 179–204. Association for Computational Linguistics, Albuquerque, New Mexico.
link|pdf.
Mironov M, Marquard A, Racek D, Heumann C, Thurner PW, Aßenmacher M (2025) A Geoparsing Pipeline for Multilingual Social Media Posts from Ukraine Proceedings of The GeoExT 2025: Geographic Information Extraction from Texts Workshop co-located with The 47th European Conference on Information Retrieval (ECIR), Lucca, Italy.
link|pdf.
Garces Arias E, Li M, Heumann C, Assenmacher M (2025) Decoding Decoded: Understanding Hyperparameter Effects in Open-Ended Text Generation Proceedings of the 31st International Conference on Computational Linguistics, pp. 9992–10020. Association for Computational Linguistics, Abu Dhabi, UAE.
link|pdf.
Garces Arias E, Rodemann J, Li M, Heumann C, Aßenmacher M (2024) Adaptive Contrastive Search: Uncertainty-Guided Decoding for Open-Ended Text Generation Findings of the Association for Computational Linguistics: EMNLP 2024, pp. 15060–15080. Association for Computational Linguistics, Miami, Florida, USA.
link|pdf.
Aßenmacher M, Karrlein L, Schiele P, Heumann C (2024) Introducing wwm-german-18k - Can LLMs Crack the Million? (Or Win at Least 500 Euros?) Proceedings of the 7th International Conference on Natural Language and Speech Processing (ICNLSP 2024), pp. 287–296. Association for Computational Linguistics, Trento.
link|pdf.
Urchs S, Thurner V, Aßenmacher M, Heumann C, Thiemichen S (2024) Detecting Gender Discrimination on Actor Level Using Linguistic Discourse Analysis Proceedings of the 5th Workshop on Gender Bias in Natural Language Processing (GeBNLP), pp. 140–149. Association for Computational Linguistics, Bangkok, Thailand.
link|pdf.
Aßenmacher M, Stephan A, Weissweiler L, Çano E, Ziegler I, Härttrich M, Bischl B, Roth B, Heumann C, Schütze H (2024) Collaborative Development of Modular Open Source Educational Resources for Natural Language Processing Proceedings of the Sixth Workshop on Teaching NLP, pp. 43–53. Association for Computational Linguistics, Bangkok, Thailand.
link|pdf.
Pavlopoulos J, Kougia V, Garces Arias E, Platanou P, Shabalin S, Liagkou K, Papadatos E, Essler H, Camps J-B, Fischer F (2024) Challenging Error Correction in Recognised Byzantine Greek Proceedings of the 1st Workshop on Machine Learning for Ancient Languages (ML4AL 2024), pp. 1–12. Association for Computational Linguistics, Bangkok, Thailand.
link|pdf.
Mittermeier A, Aßenmacher M, Schachtner B, Grosu S, Dakovic V, Kandratovich V, Sabel B, Ingrisch M (2024) Automatische ICD-10-Codierung. Die Radiologie, 1–7.
link.
Deiseroth B, Meuer M, Gritsch N, Eichenberg C, Schramowski P, Aßenmacher M, Kersting K (2024) Divergent Token Metrics: Measuring degradation to prune away LLM components – and optimize quantization Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), pp. 6764–6783. Association for Computational Linguistics, Mexico City, Mexico.
link|pdf.
Mayer L, Heumann C, Aßenmacher M (2024) Can OpenSource beat ChatGPT? - A Comparative Study of Large Language Models for Text-to-Code Generation Proceedings of the 9th edition of the Swiss Text Analytics Conference, pp. 1–20. Association for Computational Linguistics, Chur, Switzerland.
link|pdf.
Aßenmacher M, Sauter N, Heumann C (2024) Classifying multilingual party manifestos: Domain transfer across country, time, and genre Proceedings of the 9th edition of the Swiss Text Analytics Conference, pp. 21–31. Association for Computational Linguistics, Chur, Switzerland.
link|pdf.
Gruber C, Hechinger K, Aßenmacher M, Kauermann G, Plank B (2024) More Labels or Cases? Assessing Label Variation in Natural Language Inference Proceedings of the Third Workshop on Understanding Implicit and Underspecified Language, pp. 22–32. Association for Computational Linguistics, Malta.
link|pdf.
Garces Arias E, Pai V, Schöffel M, Heumann C, Aßenmacher M (2023) Automatic Transcription of Handwritten Old Occitan Language Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, pp. 15416–15439. Association for Computational Linguistics, Singapore.
link|pdf.
Öztürk IT, Nedelchev R, Heumann C, Garces Arias E, Roger M, Bischl B, Aßenmacher M (2023) How Different Is Stereotypical Bias Across Languages? 3rd Workshop on Bias and Fairness in AI (co-located with ECML-PKDD 2023),
link|pdf.
Witte M, Schwenzow J, Heitmann M, Reisenbichler M, Aßenmacher M (2023) Potential for Decision Aids based on Natural Language Processing Proceedings of the European Marketing Academy, 52nd, (114322),
link|pdf.
Aßenmacher M, Rauch L, Goschenhofer J, Stephan A, Bischl B, Roth B, Sick B (2023) Towards Enhancing Deep Active Learning with Weak Supervision and Constrained Clustering Proceedings of the 7th Workshop on Interactive Adaptive Learning (co-located with ECML-PKDD 2023),
link|pdf.
Akkus C, Chu L, Djakovic V, Jauch-Walser S, Koch P, Loss G, Marquardt C, Moldovan M, Sauter N, Schneider M, Schulte R, Urbanczyk K, Goschenhofer J, Heumann C, Hvingelby R, Schalk D, Aßenmacher M (2023) Multimodal Deep Learning. arXiv preprint arXiv:2301.04856.
link|pdf.
Koch P, Nuñez GV, Garces Arias E, Heumann C, Schöffel M, Häberlin A, Aßenmacher M (2023) A tailored Handwritten-Text-Recognition System for Medieval Latin First Workshop on Ancient Language Processing (ALP 2023),
link|pdf.
Rauch L, Aßenmacher M, Huseljic D, Wirth M, Bischl B, Sick B (2023) ActiveGLAE: A Benchmark for Deep Active Learning with Transformers Machine Learning and Knowledge Discovery in Databases: Research Track. ECML PKDD 2023,
link|pdf.
Schulze P, Wiegrebe S, Thurner PW, Heumann C, Aßenmacher M (2023) A Bayesian approach to modeling topic-metadata relationships. AStA Advances in Statistical Analysis 108, 333–349.
link.
Urchs S, Thurner V, Aßenmacher M, Heumann C, Thiemichen S (2023) How Prevalent is Gender Bias in ChatGPT? - Exploring German and English ChatGPT Responses 1st Workshop on Biased Data in Conversational Agents (co-located with ECML-PKDD 2023),
link|pdf.
Aßenmacher M, Dietrich M, Elmaklizi A, Hemauer EM, Wagenknecht N (2022) Whitepaper: New Tools for Old Problems.
link.
Koch P, Aßenmacher M, Heumann C (2022) Pre-trained language models evaluating themselves - A comparative study Proceedings of the Third Workshop on Insights from Negative Results in NLP, pp. 180–187. Association for Computational Linguistics, Dublin, Ireland.
link|pdf.
Lebmeier E, Aßenmacher M, Heumann C (2022) On the current state of reproducibility and reporting of uncertainty for Aspect-based Sentiment Analysis Machine Learning and Knowledge Discovery in Databases (ECML-PKDD), Springer International Publishing, Grenoble, France.
pdf.
Goschenhofer J, Ragupathy P, Heumann C, Bischl B, Aßenmacher M (2022) CC-Top: Constrained Clustering for Dynamic Topic Discovery Workshop on Ever Evolving NLP (EvoNLP), Association for Computational Linguistics, Abu Dhabi, United Arab Emirates.
link|pdf.