Texera Official Release (non-ASF) v1.0.0

Announcing Texera v1.0.0 (Non-ASF Release)

Since 2016, our team has been developing the Texera system to support cloud-based data science, AI, and ML using GUI-based workflows. Today (April 7, 2025), we are glad to make an official release of its first major version, v1.0.0!

The release is available on GitHub: apache/texera v1.0.0.

Major Features

  • Supporting low/no coding data science using workflows
  • Parallel data-processing engine running on computing clusters
  • Using the Apache Pekko actor-model system
  • Supporting UDFs in Python, R, and Java
  • Supporting ML training and inference
  • Including a rich collection of ML operators
  • Interactive workflow execution model that supports pausing and resuming
  • Supporting collaborations with shared editing, shared execution, and version control
  • Supporting debugging, including line-by-line debugging in Python UDFs
  • Supporting reproducibility of data analysis
  • Region-by-region execution with full pipelining in each region
  • Storing execution results using Apache Iceberg
  • Supporting version-controlled file collections on S3-compatible storage managed by LakeFS
  • Adopting a microservice-based architecture using Kubernetes and Docker
  • Supporting computing isolation and storage isolation of multiple tenants

For Users and Developers

  • For users who want to install Texera on a single machine using Docker, follow the instructions on this page.
  • For developers who want to contribute to Texera, follow the instructions on this page.

Assets

Leave a Reply

Your email address will not be published. Required fields are marked *