Zurich
9 hours ago
Master's Thesis
- Publication date:05 November 2025
- Workload:100%
- Place of work:Zurich
About the job
Master's Thesis
Open, Composable Data Platforms
Ref. 2025_031
About the Roles
Our team at IBM Research in Zürich is shaping the future of data processing for AI systems, creating self-optimizing platforms that deliver exceptional cost/performance, and are effortless for users. We work at the intersection of research and real-world impact — developing new methods, implementing them in widely used open-source projects, and supporting product teams to bring the latest innovation to our clients.
We have several positions open for Master theses or internships, around the following topics:
- Defining and extending a cross-engine serialization format of query execution plans that covers all logical data compute operations for all data modalities (structured, unstructured) and execution modes (batch, streaming) in the Substrait open-source project
- Contributing to Substrait SDKs for different programing languages (Java, Python, Rust, Go, C++, etc.) which are easy to use by humans and AI agents, well tested, documented and compliant with the Substrait specification
- Creating a rule- and/or cost-based query plan optimization framework that can natively optimize Substrait plans
- Extending the set of engines that support executing Substrait plans by either creating adapters (e.g. Substrait to PGSQL dialect adapter) or by adding native Substrait support to an engine (e.g. via PostgreSQL extension)
- Creating a visualization tool for Substrait plans which can help users reason about Substrait plans
What we are looking for
Motivated students who are interested in data processing systems. You approach problems holistically, analyzing and addressing issues across the entire technology stack.
If you’re passionate about contributing to the future of open, composable data platforms and want to see your work in the hands of real users, we’d love to hear from you.
Minimum Qualifications
- Bachelor’s degree in Databases or Systems, or a closely related field
- Strong programming skills
- Strong interest in building reusable components that are used by other developers as part of an Open-Source community
- Proficiency in Java, Python, C++, JavaScript (depending on the topic)
- Excellent communication and presentation skills in English
- Team-oriented, self-motivated, and able to solve problems independently
Preferred Qualifications
- Experience with data management systems
- Experience in building database optimizers and data-processing systems
- Experience with contributions to open-source projects in the data processing space (e.g. Substrait, Arrow, PostgreSQL, Presto, Velox)
- Ability to work independently and adapt in a fast-paced, evolving research environment
Diversity
IBM is committed to diversity at the workplace. With us you will find an open, multicultural environment. Excellent flexible working arrangements enable all genders to strike the desired balance between their professional development and their personal lives.
How to apply
Please submit you application with a cover letter and curriculum vitae through the link below.