Flow through data pipelines with Flowman
Flowman is a powerful and open source data build tool powered by Apache Spark that follows a declarative approach to simplify the act of writing complex ETL, ELT and data transformation applications. The strong focus on transformation and schema management reduces your development efforts for creating robust data pipelines.
Being built on top of Apache Spark, Flowman can be run as a standalone application but can also scale by using compute clusters (Hadoop & Kubernetes) to process any amounts of data.
Focus on business logic instead of Spark boilerplate code!