about me education research industry teaching projects publications resume

Eliot Seo Shekhtman.

ABOUT ME

headshot

I am a Ph.D. student in Computer and Information Science at the University of Pennsylvania, where I am advised by Professor Nikolai Matni. My research interests are in the intersection of machine learning, control theory, and learning theory for dynamic systems, with the goal of studying fairness in deployed systems.

I additionally have a CS MS from the College of Computing and Information Science at Cornell University with a minor in Linguistics, where I was advised by Sarah Dean. Previously, I worked as a contingent machine learning engineer for Goldman Sachs in the Compliance Engineering division.

I can be reached through email at shekhe at seas dot upenn dot edu, and I have public Google Scholar and LinkedIn profiles.

In my free time, I like cooking!

EDUCATION

UPenn logoPhD @ University of Pennsylvania

Penn Engineering | Aug 2024-present | Philadelphia, PA

Ph.D. in Computer and Information Science

Advisor: Nikolai Matni

  • Learning for Dynamics and Control
  • Learning with Conditional Guarantees
  • Theory of Machine Learning
  • Modern Convex Optimization
  • Learning in Games and Games in Learning

Cornell logoMS @ Cornell University

Bowers CIS | Aug 2022-May 2024 | Ithaca, NY | [ thesis ][ slides ]

Masters of Science in Computer Science with a minor in Linguistics

Advisor: Sarah Dean

  • Cornell Aguaclara (graduate technical advisor)
  • Nexus Scholar Program (graduate consultant)
  • Machine Learning Theory
  • Machine Learning in Feedback Systems
  • Deep Generative Models
  • Computer Vision
  • Honors Introduction to Analysis
  • Game Theory I

Cornell logoBS @ Cornell University

College of Engineering | Aug 2018-May 2022 | Ithaca, NY

Bachelors of Science in Computer Science with a minor in Linguistics

Magna Cum Laude with Honors

  • Cornell University Sustainable Design (algorithms and predictions team lead)
  • Cornell Undergraduates in Linguistics (social chair)
  • Advanced Topics in Machine Learning *
  • Learning with Big Messy Data *
  • The Structure of Information Networks *
  • Principles of Large-Scale Machine Learning
  • Advanced Language Technologies *
  • Natural Language Processing
  • Introduction to Computer Vision
  • Introduction to Machine Learning
  • Introduction to Analysis of Algorithms

* Graduate level courses

BCHS logoBethlehem Central High School

Sep 2013-Jun 2018 | Delmar, NY

  • Science Bowl (co-captain)
  • Linguistics Club (president)
  • Science Olympiad
  • Masterminds
  • Varsity Cross-Country
  • Varsity Track
  • Varsity Swim and Dive
  • AP Computer Science A
  • Introduction to Computer Science
  • Digital Electronics

RESEARCH

UPenn logoNikolai Matni Lab

Aug 2024-present | Philadelphia, PA

Research direction: system identification of dynamic systems

Cornell logoSarah Dean Lab

Jan 2023-May 2024 | Ithaca, NY

Research direction: learning theory for dynamic systems

Cornell logoKilian Weinberger Lab

Sept 2022-Apr 2023 | Ithaca, NY

Research direction: natural language processing

Cornell logoUdell Group

Advisor: Madeleine Udell | Jun 2020-Jul 2022 | Ithaca, NY

Research direction: unsupervised learning

Cornell logoLife History Lab

Advisor: Marlen Gonzalez | Jun 2019-Dec 2021 | Ithaca, NY

Research direction: computational neuroscience

SUNY Albany logoSUNY Albany Forensic Chemistry Lab

Advisor: Igor Lednev | Jun 2016-Aug 2017 | Albany, NY

Research direction: computational biology

INDUSTRY EXPERIENCE

Goldman Sachs logoGoldman Sachs | Contingent Worker

Role: Machine Learning Engineer @ Compliance Engineering

Sept 2022-Mar 2025 | Remote

Goldman Sachs logoGoldman Sachs | Internship

Role: Engineering Summer Analyst @ Compliance Engineering

Jun 2022-Aug 2022 | New York City, NY

Goldman Sachs logoGoldman Sachs | Internship

Role: Engineering Summer Analyst @ Compliance Engineering

Jun 2021-Aug 2021 | New York City, NY

TEACHING EXPERIENCE

Cornell logoCS 4670 Introduction to Computer Vision

Role: Graduate Teaching Assistant

Spring 2024 | Ithaca, NY

Cornell logoCS 6785 Applied Machine Learning

Role: Head Teaching Assistant

Fall 2023 | New York City, NY

Cornell logoCS 4780 Introduction to Machine Learning

Role: (Graduate) Teaching Assistant

Spr 2021-Spr 2023 | Ithaca, NY

Cornell logoCS 3110 Data Structures and Functional Programming

Role: Teaching Assistant

Fall 2020 | Ithaca, NY

UCode logoUCode Programming Academy

Role: Instructor

Jun 2019-Sep 2020 | Ithaca, NY

BCHS logoBethlehem Central High School

Role: Tutor

Sep 2017-May 2018 | Delmar, NY

PROJECTS

Cornell logoNexus Scholars Program Consulting

May 2022-Jan 2023 | Ithaca, NY

CUSD logoCornell University Sustainable Design (CUSD)

Feb 2020-Jun 2022 | Ithaca, NY

Cornell logoModeling Complex Contagion on Clique Based Networks

Feb 2021 | Ithaca, NY

PUBLICATIONS/PREPRINTS

UPenn logoNearly Instance-Optimal Parameter Recovery from Many Trajectories via Hellinger Localization (arXiv preprint)

Authors: Eliot Shekhtman*, Yichen Zhou*, Ingvar Ziemann, Nikolai Matni, and Stephen Tu

"A framework for bounding MLE parameter recovery rates for broad classes of multi-trajectory sequential learning settings."

Cornell logoStrategic Usage in a Multi-Learner Setting (AISTATS 2024)

Authors: Eliot Shekhtman, and Sarah Dean

[ poster ]

"What happens when adversarial agents switch between services to thwart the latter's filtration and reward maximization efforts?"

Cornell logoStrategic Usage in a Multi-Learner Setting (EcoSys 2024 @ AAAI)

Authors: Eliot Shekhtman, and Sarah Dean

[ poster ][ oral presentation slides ][ lighting talk ]

Real-world systems often involve some pool of users choosing between a set of services. Extensive prior research has been conducted on the effects of strategic users in single-service settings, with strategic behavior manifesting in the manipulation of observable features to achieve a desired classification; however, this can often be costly or unattainable for users and fails to capture the full behavior of multi-service dynamic systems. We analyze a setting in which strategic users choose among several available services in order to pursue positive classifications, while services seek to minimize loss functions on their observations. We show that naive retraining can lead to oscillation even if all users are observed at different times; however, we show necessary and sufficient conditions to guarantee convergent behavior if this retraining uses memory. We provide results obtained from synthetic and real-world data to empirically validate our theoretical findings.

Cornell logoLatent Diffusion for Language Generation (NeurIPS 2023)

Authors: Justin Lovelace, Varsha Kishore, Chao Wan, Eliot Shekhtman, and Kilian Weinberger

"Fitting a diffusion model on the latent space of a pretrained language model enables sample-efficient unconditional, class-conditional, and sequence-to-sequence language generation, improving on prior efforts to use diffusion for language."

  • Implemented loss masking and normalization for training the latent diffusion model on BART encoder latents.
  • Implemented text sampling using a BART decoder.
  • Ran baseline experiments using GPT-2.

Cornell logoOnline Missing Value Imputation and Correlation Change Detection for Mixed-type Data via Gaussian Copula (AAAI 2022)

Authors: Yuxuan Zhao, Eric Landgrebe, Eliot Shekhtman, and Madeline Udell

"A novel online algorithm for scalable and flexible missing value imputation using the Gaussian copula."

  • Implemented the online expectation-maximization algorithm.
  • Ran experiments on performance in the minibatch and online settings.