Analysis and classification of German media articles content (text and images) using ML/AI and conventional text processing
- When: Apply until 31.08.2023.
- How to apply: Send us an e-mail (at the end of this page) with your documents.
💡 Background
The GETT (GenderEqualityTrackingTool) aims to continuously monitor the presence of scientists in media and quantify their appearance and analyze how they are presented. Therefore, we require different sources to be integrated into the software based on a list of German online media outlets.
The aim of the project is to implement analysis tools to classify the content of the articles regarding topics, mentioned and quoted personas.
In further projects the analyzed data will be used to compare the representation of scientists in the media between different outlets and track possible changes over time.
🦾Who We Are
yathos is a software development and consulting company with a focus on tailor made software for research and businesses. We aim to provide reliant and low maintenance software products. This ensures the future success of our customers. We provide the full service from consulting, project management, implementation, and operation of software.
🎯 Goals
Build processes that use text-based input and classify the data based on given classes. Design processes that identify persons in given texts and images. Further implement processes that gather how persons are described in texts and displayed in images. Use the gathered information to classify the display of people in the texts.
🎓 Profile
- Skills using/ applying:
- AI Language Models
- Classification algorithms (Nearest Neighbor, Naïve Bayes, SVM, …)
- Sentiment analysis
- Python coding experience
- Bonus: experience with Java/JavaEE, Docker
📄 Deliverables
The students must provide the developed source code, enabling GETT to use the result and alter the code if needed.
A documentation of the code must be present inline. A separate short documentation of the developed functionality is to be created.
📝 How to Apply
If you are interested, please contact Maxi Görnitz (maxi.goernitz@tum.de) by submitting the following documents until 15.06.2023 in one PDF:
- Grade report
- Short overview of your experience in software development, including a list of coding languages and technologies in which you already have knowledge
- Short experience report with AI: What tools have already been used, what prior knowledge do you have?