Pentaho Community Meeting 2015 - London



The Pentaho Community Meeting of 2015 was held in London on the 7th of November. The conference was held at the W12 Conferences centre in West London.

First Keynote

Bob Kemper the executive VP of Engineering talked about the roadmap of Pentaho for next year. He talked about the take over by Hitachi Group Company and the impact on the Pentaho Software

Second Keynote

Pedro Alves talked about the roadmap for Ctools. They want to simplify Ctools in such a way that new users can easily develop dashboards, via a Dashboard Assembler. The product relies on two main cores: PDI & the Pentaho server. The dashboard Assembler should be available in the user console. The dashboardscan be developed via a drag and drop method. The tool will be designed in such a way that the developer can easily see the result. A few steps need to be taken before they can start to build the Dashboard Assembler, among other things the documentation of Ctools need to be completed.


Pedro Vale presented some new features that are available in Pentaho 6.0.

  • OSGI is now a core component: standard plugin architecture and easier to use dependency management, versioning. Shared plugin architecture – Plugins can be used for PDI and BA server
  • Marketplace has the same look and feel in PDI and on the BA server. They also have nearly the same code base. The ability to install OSGI based plugins is added to the marketplace
  • Dashboard component in CDE
  • Data Lineage in PDI – Gives you the capability of getting information of data


Jens Bleuel presented new features in PDI 6.0 and how to use them. Jens gave a demo on how to use the data lineage on a fact and dimension table. In the future data lineage would be implemented in PDI.

Jens talked about Pentaho data services which are improved in the 6.0 version.

The shim configuration is also improved in the 6.0 version among other improvements such as the removal of the default generate rows transformation when executing a transformation on the carte server.


Hans Van Akelyen presented the dynamic ETL he developed a dynamic map reduce. Via a scenario the user is able to process different files with a different layout. In the scenario you can specify the copy, mapper, reducer and delete transformation. Hans demonstrate the dynamic ETL via a copy of files on hdfs, a mapper and a reducer to result in one file. The transformations of the dynamic map-reduce are meta-injected via the meta-injection step, the fields that are not supported for meta-injection are injected via JavaScript via the API.


JuanjoOrtilles talked about WAQE. Juanjo made a model in de user console and via het WAQE plugin he then queried the model. WAQE is a simple console which enables the user to query the data without any knowledge about the model.


Roland BoumantalkedaboutPhase a Pentaho Analysis editor. Phase is a variety of editors: XML source, Forms, Diagrams. Phase is available via the Pentaho marketplace. Phase is an alternative for Schema Workbench. You are able to design a schema via dragging and dropping the tables, you are also able to alter the xml.

Roland introducedPash, an Pentaho Analysis Shell. Pash is an interactive XML/A query tool written in html/JavaScript/CSS. With Pash you are able to connect to a XML for Analysis, retrieve metadata about the available cubes, dimensions, measures and you can execute MDX queries.

Roland introduced Xavier, an XML for Analysis Visualizer. The features of Xavier are: OLAP explorer, Pluggable, Excel export, User documentation.


Antonio GarciapresentedtheArtifactCatalog, analternative web-based user interface. The sparkle plugin allows placing any resource into one or more sections and these sections can be nested and filtered depending on the role of the user. The plugin list the files on the server. As an administrator you are able to perform quick reviews of the permissions assigned to a particular role or user. Permissions can be toggled as well.


Miguel Cunhalpresented 10 tips forPDI: splitting files based on input, variables in JavaScript, partial sort, steps that support metadata-injection, copy values to all rows, verify transformation, variables ,password encryption and regex.


Tom Barber talked about the Saiku 3 update. He demoed the standalone version of Saiku. The application has now a Dashboard designer embedded where you can drag and drop charts, maps and tables.


SébastianJelsch talked about the possibility to implement Mondrian to execute MDX queries on top of Apache Kylin. Apache Kylin is an Extreme OLAP Engine for Big Data. Inovex, the firm of SébastianJelsch created a Kylin dialect and optimized the Kylins JDBC driver. The dialect is tested on the current Saiku and Mondrian 4.4.


Matt Casters closed the Pentaho Community Meeting with a call to collaborate on PDI. And a cool new feature to have some sample data to test your transformation. More information can be found on his blog.