list/projects [walkthroughs] cs.LG cs.CV eess.AS

Computer Science > Project pages

Projects

Deep walkthroughs of the research lines I lead. Each page tells the full story across the underlying papers, with figures, derivations, and small interactive widgets.

research

NeurIPS 2024 ICASSP 2026 DMLR 2026 ML4H 2023

Representation-Based Data Quality Audits

A learned representation carries traces of the data that trained it, beyond what the labels reveal. We turn those traces into an auditor that flags off-topic samples, near-duplicates, and label errors, with no clean reference data and no per-issue training.

  • data-centric ML
  • self-supervised learning
  • medical imaging
  • audio
read walkthrough

Under review

Auditing a Scientific Field through its Representations

If we know when to trust representation geometry, the same geometry can audit not just a single dataset but an entire scientific archive. We aggregate 1.1 million dermatology images from 29 public datasets into one shared space and quantify the field's demographic skew, novelty plateaus, and structural voids from that geometry.

  • representation learning
  • field-scale audit
  • medical imaging
  • data-centric ML
read walkthrough