Computer Science
2025
Pasta: A Cost-Based Optimizer for Generating Pipelining Schedules for Dataflow DAGs Proceedings Article
In: Proceedings of the ACM SIGMOD International Conference on Management of Data, 2025, (To appear).
IcedTea: Efficient and Responsive Time-Travel Debugging in Dataflow Systems Journal Article
In: Proc. VLDB Endow., 2025, (To appear).
2024
Texera: A System for Collaborative and Interactive Data Analytics Using Workflows Journal Article
In: Proc. VLDB Endow. Scalable Data Science track, vol. 17, no. 11, pp. 3580–3588, 2024.
Demonstration of Udon: Line-by-line Debugging of User-Defined Functions in Data Workflows Honorable Mention Proceedings Article
In: Barceló, Pablo; Sánchez-Pi, Nayat; Meliou, Alexandra; Sudarshan, S. (Ed.): Companion of the 2024 International Conference on Management of Data, SIGMOD/PODS 2024, Santiago AA, Chile, June 9-15, 2024, pp. 476–479, ACM, 2024, (Best Demo Runner-Up Award).
40th International Conference on Data Engineering, ICDE 2024 - Workshops, Utrecht, Netherlands, May 13-16, 2024, IEEE, 2024, (*The first two authors share equal contributions).
2023
Using Texera to Characterize Climate Change Discussions on Twitter During Wildfires Presentation
Data Science Day at KDD 2023, 22.08.2023.
Building a Collaborative Data Analytics System: Opportunities and Challenges Workshop
vol. 16, no. 12, 2023.
Texera: A System for Collaborative and Interactive Data Analytics Using Workflows PhD Thesis
University of California, Irvine, USA, 2023.
Proceedings of the Workshop on Human-In-the-Loop Data Analytics, HILDA 2023, Seattle, WA, USA, 18 June 2023, ACM, 2023.
University of California, Irvine, USA, 2023.
Udon: Efficient Debugging of User-Defined Functions in Big Data Systems with Line-by-Line Control Journal Article
In: Proc. ACM SIGMOD, vol. 1, no. 4, pp. 225:1–225:26, 2023.
2022
Reshape: Adaptive Result-aware Skew Handling for Exploratory Analysis on Big Data Journal Article
In: CoRR, vol. abs/2208.13143, 2022.
Demonstration of Accelerating Machine Learning Inference Queries with Correlative Proxy Models Journal Article
In: Proc. VLDB Endow., vol. 15, no. 12, pp. 3734–3737, 2022.
Optimizing Machine Learning Inference Queries with Correlative Proxy Models Journal Article
In: Proc. VLDB Endow., vol. 15, no. 10, pp. 2032–2044, 2022.
Demonstration of Collaborative and Interactive Workflow-Based Data Analytics in Texera Journal Article
In: Proc. VLDB Endow., vol. 15, no. 12, pp. 3738–3741, 2022.
Towards Interactive, Adaptive and Result-aware Big Data Analytics PhD Thesis
University of California, Irvine, USA, 2022.
Fries: Fast and Consistent Runtime Reconfiguration in Dataflow Systems with Transactional Guarantees Journal Article
In: Proc. VLDB Endow., vol. 16, no. 2, pp. 256–268, 2022.
Drove: Tracking Execution Results of Workflows on Large Datasets Workshop
PhD Workshop at VLDB 2022, 2022.
2020
Demonstration of Interactive Runtime Debugging of Distributed Dataflows in Texera Journal Article
In: Proc. VLDB Endow., vol. 13, no. 12, pp. 2953–2956, 2020.
Amber: A Debuggable Dataflow System Based on the Actor Model Journal Article
In: Proc. VLDB Endow., vol. 13, no. 5, pp. 740–753, 2020.
2017
A Demonstration of TextDB: Declarative and Scalable Text Analytics on Large Data Sets Best Paper Proceedings Article
In: 33rd IEEE International Conference on Data Engineering, ICDE 2017, San Diego, CA, USA, April 19-22, 2017, pp. 1403–1404, IEEE Computer Society, 2017, (TextDB was later renamed to Texera in 2018).
Interdisciplinary
2025
DS4ALL: Teaching High-School Students Data Science and AI/ML Using the Texera Workflow Platform as a Service Proceedings Article
In: Data Science Education K-12: Research to Practice Annual Conference, Data Science for Everyone Initiative San Antonio, Texas, 2025.
2024
CloudMapper: Accelerating Single-Cell RNA Sequence Alignment with a Scalable and User-Friendly Cloud-Based Platform Presentation
San Diego Convention Center, San Diego, CA, USA, 16.12.2024.
Brain image data processing using collaborative data workflows on Texera Journal Article
In: Frontiers in Neural Circuits, vol. 18, pp. 1398884, 2024.
How the experience of California wildfires shape Twitter climate change framings Journal Article
In: Climatic Change, vol. 177, no. 1, pp. 1–21, 2024.
Wording Matters: the Effect of Linguistic Characteristics and Political Ideology on Resharing of COVID-19 Vaccine Tweets Journal Article
In: Transactions on Computer-Human Interaction (TOCHI), 2024.
2023
The marketing and perceptions of non-tobacco blunt wraps on Twitter Journal Article
In: Substance Use and Misuse, 2023.
Understanding underlying moral values and language use of COVID-19 vaccine attitudes on twitter Journal Article
In: PNAS nexus, vol. 2, no. 3, pp. pgad013, 2023.
2022
Public Opinions toward COVID-19 Vaccine Mandates: A Machine Learning-based Analysis of U.S. Tweets Proceedings Article
In: AMIA 2022, American Medical Informatics Association Annual Symposium, Washington, DC, USA, November 5-9, 2022, AMIA, 2022.
2021
The social amplification and attenuation of COVID-19 risk perception shaping mask wearing behavior: a longitudinal twitter analysis Journal Article
In: PloS one, vol. 16, no. 9, pp. e0257428, 2021.
Why do people oppose mask wearing? A comprehensive analysis of U.S. tweets during the COVID-19 pandemic Journal Article
In: J. Am. Medical Informatics Assoc., vol. 28, no. 7, pp. 1564–1573, 2021.