Publications

Jump to full-list, patents and thesis

Highlights

A Nutritional Label for Rankings

A Web-based application that generates a “nutritional label” for rankings. Ranking Facts is made up of a collection of visual widgets that implement our latest research results on fairness, stability, and transparency for rankings, and that communicate details of the ranking methodology, or of the output, to the end user.

Ke Yang, Julia Stoyanovich, Abolfazl Asudeh, Bill Howe, HV Jagadish and Gerome Miklau

SIGMOD (demo), 2018

DataSynthesizer: Privacy-preserving synthetic datasets

To facilitate collaboration over sensitive data, we present DataSynthesizer, a tool that takes a sensitive dataset as input and generates a structurally and statistically similar synthetic dataset with strong privacy guarantees.

Haoyue Ping, Julia Stoyanovich and Bill Howe

SSDBM, 2017

Fides: A platform for responsible data science

We see a need for a data sharing and collaborative analytics platform with features to encourage (and in some cases, enforce) best practices at all stages of the data science lifecycle. We propose Fides, in the context of urban analytics, outlining a systems research agenda in responsible data science.

Bill Howe, Julia Stoyanovichi, Serge Abiteboul, Gerome Miklau, Arnaud Sahuguet and Gerhard Weikum

SSDBM, 2017

 

Full List

Teaching Responsible Data Science: Charting New Pedagogical Territory
Julia Stoyanovich and Armanda Lewis
International Journal of Artificial Intelligence in Education (IJAIED), 2021

Causal Intersectionality and Fair Ranking
Ke Yang, Joshua R. Loftus, and Julia Stoyanovich
Proceedings of FORC 2021

Lightweight Inspection of Data Preprocessing in Native Machine Learning Pipelines
Stefan Grafberger, Julia Stoyanovich, and Sebastian Schelter
Proceedings of CIDR 2021

Taming Technical Bias in Machine Learning Pipelines
Sebastian Schelter and Julia Stoyanovich
IEEE Data Engineering Bulletin 43(4): 2020

Fairness in Ranking: A Survey
Meike Zehlike, Ke Yang, and Julia Stoyanovich
arXiv

Impact Remediation: Optimal Interventions to Reduce Inequality
Lucius E. J. Bynum, Joshua R. Loftus, and Julia Stoyanovich
arXiv

Fairness as Equality of Opportunity: Normative Guidance from Political Philosophy
Falaah Arif Khan, Eleni Manis, and Julia Stoyanovich
arXiv

Fairness and Friends
Falaah Arif Khan, Eleni Manis, and Julia Stoyanovich
ACM FAccT (2021), tutorial slides

Responsible Data Management
Julia Stoyanovich, Bill Howe, and H.V. Jagadish
PVLDB 13(12): 3474-3489 (2020), invited paper accompanying VLDB 2020 keynote presentation

The Imperative of Interpretable Machines
Julia Stoyanovich, Jay J. Van Bavel, and Tessa V. West
Nature Machine Intelligence, April 2020

Zooming out on an Evolving Graph
Amir Aghasadeghi, Vera Z. Moffitt, Sebastian Schelter, and Julia Stoyanovich
Proceedings of EDBT 2020

Fairness-Aware Instrumentation of Preprocessing Pipelines for Machine Learning
Ke Yang, Biao Huang, Julia Stoyanovich, and Sebastian Schelter
Proceedings of HILDA 2020 (an ACM SIGMOD workshop)

FairPrep: Promoting Data to a First-Class Citizen in Studies on Fairness-Enhancing Interventions
Sebastian Schelter, Yuxuan He, Jatin Khilnani, and Julia Stoyanovich

Balanced Ranking with Diversity Constraints
Ke Yang, Vasilis Gkatzelis, and Julia Stoyanovich
Proceedings of IJCAI 2019

Designing Fair Ranking Schemes
Abolfazl Asudeh, H. V. Jagadish, Julia Stoyanovich, and Gautam Das
Proceedings of ACM SIGMOD, 2019

MithraRanking: A System for Responsible Ranking Design (demonstration)
Yifan Guan, Abolfazl Asudeh, Pranav Mayuram, H. V. Jagadish, Julia Stoyanovich, Gerome Miklau, and Gautam Das
Proceedings of ACM SIGMOD, 2019

Transparency, Fairness, Data Protection, Neutrality: Data Management Challenges in the Face of New Regulation
Serge Abiteboul and Julia Stoyanovich
ACM Journal of Data and Information Quality, 2019

Nutritional Labels for Data and Models
Julia Stoyanovich and Bill Howe
IEEE Data Engineering Bulletin 42(3): 13-23 (2019)

Towards Responsible Data-driven Decision Making in Score-Based Systems
Abolfazl Asudeh, H. V. Jagadish, and Julia Stoyanovich
IEEE Data Engineering Bulletin 42(3): 76-87 (2019)

TransFAT: Translating Fairness, Accountably and Transparency into Data Science Practice
Julia Stoyanovich
International Workshop on Processing Information Ethically (PIE@CAiSE) (2019)

On Obtaining Stable Rankings
Abolfazl Asudeh, H. V. Jagadish, Gerome Miklau, and Julia Stoyanovich
PVLDB 12(3): 237-250 (2018)

Panel: A Debate on Data and Algorithmic Ethics
Julia Stoyanovich, Bill Howe, H. V. Jagadish, Gerome Miklau
PVLDB 11(12): 2165-2167 (2018)

Research Directions for Principles of Data Management (Dagstuhl Perspectives Workshop 16151)
Serge Abiteboul, Marcelo Arenas, Pablo Barceló, Meghyn Bienvenu, Diego Calvanese, Claire David, Richard Hull, Eyke Hüllermeier, Benny Kimelfeld, Leonid Libkin, Wim Martens, Tova Milo, Filip Murlak, Frank Neven, Magdalena Ortiz, Thomas Schwentick, Julia Stoyanovich, Jianwen Su, Dan Suciu, Victor Vianu, Ke Yi
Dagstuhl Manifestos 7(1): 1-29 (2018)

A Technical Research Agenda in Data Ethics and Responsible Data Management
Julia Stoyanovich, Bill Howe, and HV Jagadish
SIGMOD, 2018

A Query Engine for Probabilistic Preferences
Uzi Cohen, Batya Kenig, Haoyue Ping, Benny Kimelfeld, and Julia Stoyanovich
SIGMOD, 2018

A Nutritional Label for Rankings
Ke Yang, Julia Stoyanovich, Abolfazl Asudeh, Bill Howe, HV Jagadish and Gerome Miklau
SIGMOD (demo), 2018

Online Set Selection with Fairness and Diversity Constraints
Julia Stoyanovich, Ke Yang and HV Jagadish
EDBT, 2018

Computational Social Choice Meets Databases
Benny Kimelfeld, Phokion Kolaitis and Julia Stoyanovich
IJCAI, 2018

Probabilistic inference over repeated insertion models
Batya Kenig, Lovro Ilijasic, Haoyue Ping, Benny Kimelfeld, and Julia Stoyanovich
AAAI, 2018

Generating Evolving Property Graphs with Attribute-Aware Preferential Attachment
Amir Aghasadeghi and Julia Stoyanovich
DBTest, 2018

MobilityMirror: Bias-Adjusted Synthetic Transportation Datasets
Luke Rodriguez, Babak Salimi, Haoyue Ping, Julia Stoyanovich and Bill Howe
BiDU, 2018

Portal: A Query Language for Evolving Graphs
Vera Zaychik Moffitt and Julia Stoyanovich
SIGMOD, 2017

Querying probabilistic preferences in databases
Batya Kenig, Benny Kimelfeld, Haoyue Ping and Julia Stoyanovich
PODS, 2017

Towards sequenced semantics for evolving graphs
Vera Z. Moffitt and Julia Stoyanovich
EDBT, 2017

DataSynthesizer: Privacy-preserving synthetic datasets
Haoyue Ping, Julia Stoyanovich and Bill Howe
SSDBM, 2017

Fides: A platform for responsible data science
Bill Howe, Julia Stoyanovichi, Serge Abiteboul, Gerome Miklau, Arnaud Sahuguet and Gerhard Weikum
SSDBM, 2017

Diversity in Big Data: A Review
Marina Drosou, HV Jagadish, Evaggelia Pitoura and Julia Stoyanovich
Big Data Special Issue on Social and Technical Trade-Offs, June 2017

Temporal Graph Algebra
Vera Z. Moffitt and Julia Stoyanovich
DBPL, 2017

Synthetic data for social good
Bill Howe, Julia Stoyanovich, Haoyue Ping, Bernease Herman, and Matt Gee
Proceedings of Data for Good Exchange (D4GX), 2017

Zooming in on NYC taxi data with Portal
Julia Stoyanovich, Matthew Gilbride and Vera Z. Moffitt
Proceedings of Data Science for Social Good (DSSG), 2017

Measuring fairness in ranked outputs
Ke Yang and Julia Stoyanovich
SSDBM, 2017

A database framework for probabilistic preferences
Batya Kenig, Benny Kimelfeld, Haoyue Ping and Julia Stoyanovich
AMW, 2017

Workload-driven learning of Mallows mixtures with pairwise preference data
Julia Stoyanovich, Lovro Ilijasic and Haoyue Ping
WebDB, 2016, co-located with ACM SIGMOD 2016

Data, responsibly: fairness, neutrality and transparency in data analysis
Julia Stoyanovich, Serge Abiteboul and Gerome Miklau
EDBT, 2016

Data, responsibly (Dagstuhl Seminar 16291)
Serge Abiteboul, Gerome Miklau, Julia Stoyanovich and Gerhard Weikum
Schloss Dagstuhl Seminar Report, 2016

Towards a distributed infrastructure for evolving graph analytics
Vera Zaychik Moffitt and Julia Stoyanovich
TempWeb 2016, co-located with WWW 2016

Collaborative Access Control in WebdamLog
Vera Zaychik Moffitt, Julia Stoyanovich, Serge Abiteboul, and Gerome Miklau
SIGMOD 2015

Analyzing Crowd Rankings
Julia Stoyanovich, Marie Jacob, Xuemei Gong
WebDB 2015

A System for Management and Analysis of Preference Data
Marie Jacob, Benny Kimelfeld, Julia Stoyanovich
PVLDB 7(12), 2014

Search and Result Presentation in Scientific Workflow Repositories
Susan Davidson, Xiaocheng Huang, Julia Stoyanovich, Xiaojie Yuan
SSDBM, 2013

Learning to Explore Scientific Workflow Repositories (short paper)
Julia Stoyanovich, Paramveer Dhillon, Brian Lyons, Susan Davidson
SSDBM, 2013

Learning Feature Weights from Positive Cases
Sid Gunawardena, Rosina Weber, Julia Stoyanovich
ICCBR, 2013

Understanding Local Structure in Ranked Datasets (vision paper)
Julia Stoyanovich, Sihem Amer-Yahia, Susan Davidson, Marie Jacob, Tova Milo
CIDR, 2013

Rule-Based Application Development using Webdamlog (demonstration)
Serge Abiteboul, Emilien Antoine, Gerome Miklau, Julia Stoyanovich, and Jules Testard
SIGMOD, 2013

Introducing Access Control in Webdamlog (invited talk by Serge Abiteboul)
Serge Abiteboul, Emilien Antoine, Gerome Miklau, Julia Stoyanovich, and Vera Zaychik Moffitt
DBPL, 2013

Viewing the Web as a Distributed Knowledge Base
Serge Abiteboul, Emilien Antoine and Julia Stoyanovich
ICDE, 2012

Putting Lipstick on Pig: Enabling Database-style Workflow Provenance
Yael Amsterdamer, Susan Davidson, Daniel Deutch, Tova Milo, Julia Stoyanovich, and Val Tannen
PVLDB 5(4), 2011

Deriving Probabilistic Databases with Inference Ensembles
Julia Stoyanovich, Susan Davidson, Tova Milo, and Val Tannen
ICDE, 2011

Keyword Search in Workflow Repositories with Access Control
Susan Davidson, Soohyun Lee and Julia Stoyanovich
AMW, 2011

Making Interval-Based Clustering Rank-Aware
Julia Stoyanovich, Sihem Amer-Yahia and Tova Milo
EDBT, 2011

On Provenance and Privacy
Susan Davidson, Sanjeev Khanna, Sudeepa Roy, Julia Stoyanovich, Val Tannen, and Yi Chen
ICDT, 2011

Enabling Privacy in Provenance-Aware Workflow Systems
Susan Davidson, Sanjeev Khanna, Sudeepa Roy, Julia Stoyanovich, Val Tannen, Yi Chen, Tova Milo
CIDR, 2011

AnnotCompute: annotation-based exploration and meta-analysis of genomics experiments
Jie Zheng, Julia Stoyanovich, Elisabetta Manduchi, Junmin Liu, Christian J. Stoeckert, Jr
The Journal of Biological Databases and Curation (Database), 2011

SkylineSearch: Semantic Ranking and Result Visualization for PubMed (demonstration)
Julia Stoyanovich, Mayur Lodha, William Mee and Kenneth A. Ross
SIGMOD, 2011

Semantic Ranking and Result Visualization for Life Sciences Publications
Julia Stoyanovich, William Mee and Kenneth A. Ross
ICDE, 2010

Exploring Repositories of Scientific Workflows
Julia Stoyanovich, Ben Taskar, and Susan Davidson
WANDS, 2010

Rank-Aware Clustering for Structured Datasets
Julia Stoyanovich and Sihem Amer-Yahia
CIKM, 2009

Rank-Aware Clustering for Structured Datasets
Julia Stoyanovich and Sihem Amer-Yahia
Columbia University Technical Report cucs-043-089, 2009

Efficient Network-Aware Search in Collaborative Tagging Sites
Sihem Amer-Yahia, Michael Benedikt, Laks Lakshmanan and Julia Stoyanovich
VLDB, 2008

From del.icio.us to x.qui.site: Recommendations in Social Tagging Sites (demonstration)
Sihem Amer-Yahia, Alban Galland, Julia Stoyanovich, Cong Yu
SIGMOD, 2008

Schema Polynomials and Applications
Kenneth A. Ross and Julia Stoyanovich
EDBT, 2008

Leveraging Tagging to Model User Interests in del.icio.us
Julia Stoyanovich, Sihem Amer-Yahia, Cameron Marlow, Cong Yu
AAAI Spring Symposium on Social Information Processing (AAAI-SIP 2008)

MutaGeneSys: Making Diagnostic Predictions Based on Genome-Wide Genotype Data in Association Studies
Julia Stoyanovich and Itsik Pe’er
Bioinformatics 2008

ReoptSMART: A Learning Query Plan Cache
Julia Stoyanovich, Kenneth A. Ross, Jun Rao, Wei Fan, Volker Markl, Guy Lohman
Columbia University Technical Report cucs-023-08, 2008.

EntityAuthority: Semantically Enriched Graph-Based Authority Propagation
Julia Stoyanovich, Srikanta Bedathur, Klaus Berberich, Gerhard Weikum
WebDB, 2007

A Faceted Query Engine Applied to Archaeology
Kenneth A. Ross, Angel Janevski, Julia Stoyanovich
Internet Archaeology 21, April 2007

A Faceted Query Engine Applied to Archaeology (demonstration)
Kenneth A. Ross, Angel Janevski, Julia Stoyanovich
VLDB, 2005

Symmetric Relations and Cardinality Bounded Multisets in Database Systems
Kenneth A. Ross and Julia Stoyanovich
VLDB, 2004

Patents

  • “Automatically and Adaptively Determining Execution Plans for Queries with Parameter Markers”, Wei Fan, Guy Lohman, Volker Markl, Nimrod Megiddo, Jun Rao, David Simmen, Julia Stoyanovich. United States Patent: 7,958,113 (June 7, 2011); Assignee: IBM.
  • “Social Behavior Analysis and Inferring Social Networks for a Recommendation System”, Sihem Amer-Yahia, Evgeniy Gabrilovich, Bo Pang, Julia Stoyanovich, Cong Yu. United States Patent: 8,073,794 (December 6, 2011); Assignee: Yahoo! Inc.

Thesis

“Search and Ranking in Semantically Rich Applications”, Columbia University PhD Thesis, 10/2009, pdf