Asad, Sarah; Liu, Xiaozhen; Park, Kun Woo; Lin, Xinyuan; Bai, Jiadong; Ni, Shengquan; Huang, Yicong; Li, Chen
Click, Share, Learn: Teaching Data Science Using Apache Texera
EDBT (Demo track), 2026.
Existing data science tools, such as Jupyter Notebook, require users to be familiar with coding, making them unsuitable for introductory data-science courses that need concrete ways to illustrate core concepts, such as tables, schemas, transformations, and ML models, before students are comfortable with programming. Other education-related challenges, like collaboration and computing resource requirements, further complicate this problem. This demonstration shows how Texera, an open-source, browser-based visual workflow system supporting collaborative data science and AI/ML, is designed to overcome these challenges in the classroom setting and presents a scenario in which two students use Texera to learn data science by constructing and executing a data-science pipeline on a real dataset. This demonstration further highlights how Texera’s stepwise, visual execution can be aligned with explicit learning objectives to support hands-on teaching of data-science concepts in the classroom.