Olga Ovcharenko is a Data Science master’s student at ETH Zurich, Switzerland. Olga’s interest in data science started with her undergraduate Data Management course at Graz University of Technology, Austria. She mainly focused on distributed storage, analysis, and stream processing. Olga was part of the DAMSLab headed and supervised by Prof. Matthias Boehm. Olga had been working on the ExDRa project (exploratory data science over raw data) and contributing to the federated backend of Apache SystemDS, an open-source ML system for the end-to-end data science lifecycle. She implemented federated components such as linear algebra operations and low-level instructions. Olga is Apache SystemDS PMC member and has coauthored joint SIGMOD and CIKM papers with Siemens, TU Berlin, DFKI, and TU Graz. For her bachelor’s thesis, Olga built a distributed data generator to scale real datasets preserving statistics and error distribution of the original data. Additionally, Olga worked with Prof. Theodoros (Theo) Rekatsinas, Prof. Valentina Boeva.
Publications
2024
Philip Toma, Olga Ovcharenko, Imant Daunhawer, Julia E. Vogt, Florian Barkmann, Valentina Boeva: Benchmarking Self-Supervised Learning for Single-Cell Data [paper]. NeurIPS SSL Workshop 2024.
Olga Ovcharenko, Rita Sevastjanova, Valentina Boeva: FeatureClock: High-Dimensional Effects in Two-Dimensional Plots [library] [paper]. IEEE VIS 2024.
2022
Sebastian Baunsgaard, Matthias Boehm, Kevin Innerebner, Mito Kehayov, Florian Lackner, Olga Ovcharenko, Arnab Phani, Tobias Rieger, David Weissteiner and Sebastian Benjamin Wrede: Federated Data Preparation, Learning, and Debugging in Apache SystemDS [paper]. CIKM 2022.
2021
Sebastian Baunsgaard, Matthias Boehm, Ankit Chaudhary, Behrouz Derakhshan, Stefan Geißelsöder, Philipp M. Grulich, Michael Hildebrand, Kevin Innerebner, Volker Markl, Claus Neubauer, Sarah Osterburg, Olga Ovcharenko, Sergey Redyuk, Tobias Rieger, Alireza Rezaei Mahdiraji, Sebastian Benjamin Wrede, Steffen Zeuch: ExDRa: Exploratory Data Science on Federated Raw Data [paper]. SIGMOD 2021.
Work Experience
- Teaching Assistant (Data Modeling and Databases) - ETH Zurich. Feb - Aug 2023
- Teaching Assistant (Quality Assurance in Software Development) - TU Graz. Feb - Jul 2022
- Undergraduate Research Assistant - TU Graz. Jul 2020 - Aug 2022
- IT Intern - AIT. Feb - Aug 2020
- Teaching Assistant (Data Management) - TU Graz. Oct 2019 - Jun 2020
Education
- Master of Science, Data Science at ETH Zurich, 2022 - present
- Bachelor of Science, Computer Science at Graz University of Technology, 2019 - 2022
- Thesis: Large-scale Data Generation for Benchmarking Data Cleaning Tools [thesis]