Sebastian Schelter

Web Name: Sebastian Schelter

WebSite: http://www.ssc.io

ID:335171

Keywords:

Sebastian,Schelter

Description:


Sebastian Schelter

Assistant Professor

University of Amsterdam

I am an Assistant Professor with the University of Amsterdam, conducting research at the intersection of data management and machine learning. I am affiliated with the Intelligent Data Engineering Lab, manage the AIforRetailLab and have a joint appointment as research fellow at Ahold Delhaize.

My work addresses data-related problems that occur in the real world application of machine learning. Examples are the automation of data quality validation, the inspection of machine learning pipelines via code instrumentation, or the design of machine learning applications that can efficiently forget data.

Most of my research is accompanied by efficient and scalable open source implementations, many of which are applied in real world use cases, for example in the Amazon SageMaker Model Monitor service, the product recommendation system at bol.com and large-scale recommendation libraries in cloud environments such as Amazon Web Services and Microsoft Azure.

In the past, I have been a Faculty Fellow with the Center for Data Science at NewYorkUniversity and a Senior Applied Scientist at Amazon Research, after obtaining my Ph.D. at the database group of TU Berlin with Volker Markl. I am active in open source as an elected member of the Apache Software Foundation, and have extensive experience in building real world systems from my time at Amazon, Twitter, IBM Research, and Zalando.

News

Previous News

Aug 18, 2022 ‐ I gave an invited talk on Data Provenance as a Foundation for AI Governance at Megagon Labs.

Jul 22, 2022 ‐ The Informatics Institute of the UvA has published an interview with me on making data management responsible.

May 21, 2022 ‐ Julia Stoyanovich, Serge Abiteboul, Bill Howe, H.V. Jagadish and me have published an article on Responsible Data Management in the Communications of the ACM. We discuss perspectives on the role and responsibility of the data management research community in designing, developing, using and overseeing automated decision making systems.

May 6, 2022 ‐ I have been invited to a panel on Career Paths after Ph.D. – Perspectives from Senior and Junior Researchers at the PhD Symposium of ICDE’22.

Apr 20, 2022 ‐ I have been the editor for a Special Issue on Directions Towards GDPR-Compliant Data Systems and Applications of the IEEE Data Engineering Bulletin.

Recent Publications

All Publications

(2022). Automatic Oblivion. Sustain (Sustainable AI in Practice), Issue 1.

PDF

(2022). DORIAN in action: Assisted Design of Data Science Pipelines. VLDB(demo).

(2022). Responsible Data Management. Communications of the ACM.

PDF

(2022). Letter from the Special Issue Editor. Special issue on “Directions Towards GDPR-Compliant Data Systems and Applications” of the IEEE Data Engineering Bulletin (Vol 45, Issue 1).

PDF

(2022). Towards Data-Centric What-If Analysis for Native Machine Learning Pipelines . Data Management for End-to-End Machine Learning workshop at ACM SIGMOD.

PDF

(2022). ReCANet: A Repeat Consumption-Aware Neural Network for Next Basket Recommendation in Grocery Shopping. ACM SIGIR.

PDF

(2022). GitSchemas: A Schema Dataset for Automating Relational Data Preparation Tasks. Databases for Machine Learning workshop at ICDE.

PDF

(2022). Serving Low-Latency Session-Based Recommendations at bol.com. ECIR (industry talk).

PDF

Team

PhD Students
Mozhdeh Ariannezhad
(with Maarten de Rijke) Olivier Sprangers
(with Maarten de Rijke) Arezoo Sarvi
(with Maarten de Rijke) Barrie Kersbergen
(with Maarten de Rijke) Stefan Grafberger
(with Paul Groth) Zeyu Zhang
(with Iacer Calixto)
Researchers & Guests
Shubha Guha
Till Doehmen
Benjamin Wang

Collaborations

I am collaborating with Prof. Julia Stoyanovich from New York University on research with regard to the impact of data preprocessing on the fairness of machine-assisted decision making.

I am working with Hannes Muehleisen from Centrum Wiskunde & Informatica (CWI) on leveraging DuckDB for the efficient execution of data preprocessing in ML pipelines.

I am working with Iacer Calixto from the University of Amsterdam on problems at the intersection of responsible data management and natural language processing.

With Prof. Felix Biessmann from Beuth University Berlin, I am conducting research on data validation and data cleaning for machine learning.

I am an associated researcher with BIFOLD, the Berlin Institute for the Foundations of Learning and Data.

CV

Scientific Career

Before joining University of Amsterdam, I have been a Faculty Fellow at the Center for Data Science at New York University, and a Senior Applied Scientist at Amazon Research in Berlin, where I worked on data management-related issues of machine learning applications, such as demand forecasting, metadata and provenance tracking of machine learning pipelines and automating data quality verification.

I received my Ph.D. with “summa cum laude” from TU Berlin in 2015, where I have been advised by Volker Markl, head of the database systems and information management group. My co-supervisors were Klaus-Robert Müller from the machine learning group at TUBerlin and Reza Zadeh from Stanford. During my studies, I have been interning with the SystemML group at IBM Research Almaden and the social recommendations team at Twitter in California.

Open Source

I am engaged in open source as an elected member of the Apache Software Foundation since 2012. I have been involved in the Apache Mahout, Apache Flink, Apache Giraph and the incubation of the Apache MXNet and Apache TVM projects. Besides that I co-created Deequ, a library for ‘unit-testing’ large datasets with Apache Spark, and Serenade, a low-latency session-based recommender system deployed in production at a large Dutch retailer. Furthermore, I am a member of the Electronic Frontier Foundation since 2015.

Scientific Service

I am the founder and have chaired the workshop series on Data Management for End-To-End Machine Learning (DEEM) at ACM SIGMOD from 2017 to 2020, and an Action Editor for the ML Open Source Software track of the Journal of Machine Learning Research. I have served as Associate Editor for PVLDB Volume 15, as the editor for two special issues of the IEEE Data Engineering Bulletin in 2021 and 2022, and as co-chair of the industry and applications track of EDBT 2022.

I regularly review submissions to top tier data management conferences. I have been on the program committee at SIGMOD 2017, 2019-2023, VLDB 2021, ICDE2018-2021, EDBT 2017 & 2021, CIKM’20, the PhD Symposium at VLDB’21, the workshop on Exploiting Artificial Intelligence Techniques for Data Management at SIGMOD 2019, the Large-Scale Recommender Systems workshop at the ACMRecSys 2013-2015, the workshop on Applied AI for Database Systems and Applications at VLDB’20, on Table Representation Learning at NeurIPS’22 and Provenance Week’20. Additionally, I have reviewed submissions to journals for IEEETKDE, ACMTIST, IEEETPDS, IEEETNNLS, VLDBJournal, the VLDB Journal Special Issue on Data Science for Responsible Data Management, the journal track of ECML/PKDD and the open source track of JMLR. I am also part of the review board of the Journal of Systems Research (JSys), and have been a reviewer for the Amazon Research Awards.

At the University of Amsterdam, I coordinate the honors program for the bachelorAI and teach a course on Big Data engineering with more than 190 students.

Contact

I’m reachable via email at s.schelter[at]uva.nl. I’m also very actively using twitter as @sscdotopen. Most of the research code that I write is available under an open source license in my github account. Last but not least, I also have a profile in google scholar.

Copy Download

TAGS:Sebastian Schelter

<<< Thank you for your visit >>>

Websites to related :
Satellite Tracking System: Orbit

  Satellite tracking &bull; HAM radio &bull; ISS &bull; Visual observing &bull; Tracking software &bull; Iridium flares &bull; Satellite tracking at hom

Noticias de San Sebastian - Noti

  GRUPO 24 HORASEL PORTALUCOEUROPALATINOAMÉRICACUBApuerto ricoTWITTERfacebookyoutubeflickrrssSan SebastianEmprendiendoViajes, opinión y curiosidadesDe

Sebastian Schlueter

   /*! Squarespace LESS Compiler (less.js language v1.3.3) */.sqs-slide-wrapper[data-slide-type="cover-page"] .text-align-center{text-align:center}

Kiga St. Sebastian &#8211; Nette

  Zum Inhalt springenKiga St. SebastianNettetal-LobberichHomeÜber unsDas sind wirTeamZeitenTagesablaufElternarbeitProjekteSprachförderungSprachtherapi

Web Municipal de San Sebastián

  Página en construcción. Discupe las molestias.

hello@sebastiankoeck.at

  hello@sebastiankoeck.atStartseiteProjects StartseiteProjects hello@sebastiankoeck.at

Barone Gomme di Sebastiano Baron

   X HomeChi siamoServiziCentro di revisioneBarone assistanceAuto di cortesiaConvenzioniContatti

Sebastian Meier | Data-driven In

  Contact Sebastian MeierData-driven Innovation„Creating meaningful user-experiences by combining data-driven approaches with user-centric perspecti

Hoteles Tres Reyes | Pamplona y

   InicioHotel Tres Reyes PamplonaHabitacionesOfertasEventosRestauranteServiciosSituaciónFotosHotel Tres Reyes San SebastiánHabitacionesOfertas

Kate Schelter LLC – Home

  keywords:
description:Kate Schelter is an artist, illustrator, creative director, stylist, and author
Instagra

ads

Hot Websites