The Apache Hop team just released version 2.1.0.
This new release is the result of four and a half...
Another two months after the 2.8.0 release, the Apache Hop community is proud to announce the availability of Apache Hop 2.9.0.
Just like in previous releases, our focus has been on hardening the Apache Hop platform while adding new functionality. This release contains two months of work by 9 contributors (2 of which are new) on over 80 tickets.
Let's walk through what Apache Hop 2.9.0 brings for you.
Apache Hop offers a lot of functionality to read data from files through the CSV Input, Text File Input, JSON Input and other transforms. All of these transforms have a "Get Fields" button to let you read the file layout. Sometimes, however, you know a file's layout in advance, and don't want to scan the first x rows of data to make a (smart) guess.This is where the static schema comes into play.
A Static Schema definition lets you specify a file layout that can be used in a CSV Input, Text File Input and other transforms.
After creating a Static Schema definition in the metadata perspective, you can now use that schema to specify a file layout in one of the supported transforms.
A new Schema Mapping transform lets you map your pipeline stream layout to a static schema definition. Fields not in your pipeline stream but specified in a static schema definition will be added at the right position in your stream with a blank value.
The static schema definition metadata type and schema mapping transform were developed by know.bi in cooperation with our partner Serasoft. The development of the static schema functionality was sponsored by one of our customers who is migrating from Talend to Apache Hop and is building an entire new Apache Hop based data engineering and data integration platform. We'll have more exciting news on that customer case soon.
Another new addition to Apache Hop 2.9.0 is CrateDB. CrateDB is an enterprise database for time series, documents, and vectors.
CrateDB is based on PostgreSQL and works with the PostgreSQL JDBC driver and relational database transforms like Table Input, Table Output, Insert/Update and others.
Since CrateDB is built on top of PostgreSQL and offers additional functionality, Apache Hop 2.9.0 comes with a new CrateDB database dialect and bulk loader transform. The bulk loader transform lets you write data to CrateDB trough the COPY command or the REST endpoint.
Reach out if you want to find out more about Apache Hop, if you'd like to upgrade from PDI/Kettle or Talend, or if you'd like to discuss how we can help you build a successful data platform with Apache Hop.
The Apache Hop team just released version 2.1.0.
This new release is the result of four and a half...
We're two months into what more or less organically has become the bi-monthly release cycle for...
The Apache Hop community released Apache Hop 2.8.0 late last week. This release contains over three...
Blog comments