Catching the "bad guys" using graphs. Figure 1: Gartner layered model for fraud detection

Amazon SageMaker is a "fully managed machine learning service". This means it provisions an environment for data scientists and developers without them needing…

One of the new features in Pentaho Data Integration 8.1 is the ability to directly connect to Google Drive. PDI uses the Virtual File System (VFS) which allows…

On May, 16th 2018, Hitachi Vantara released Pentaho 8.1 Although this is a minor follow-up release to 8.0 as far as version numbers go, but nevertheless a lot…

Automate everything! Analytics projects are often treated as ad-hoc projects. Code and content are often managed in a version control system (git), but often…

What size is this? Suppose you want to predict what the length or width of a flower petal. For this we can look for a relation between the two.

What's weird about this? At certain times you might be faced with unexpected patterns or events appearing in your data. Let's take a look on how we can tackle…

How is this related? In this post, we'll take a look at how we can find out in what way data is structured or related.

Is this A, or B? As a follow-up to last week's machine learning tidbit let's look at an example of how we can solve a classification problem using machine…

What is a graph database? Although graph theory has been around for centuries, graph databases began their rise to popularity relatively recently. A graph…