Log files exploration with Oracle Big Data Discovery 1.1

Christos - Iraklis TsatsoulisBig Data, Exploratory Data Analysis, Oracle Big Data Discovery 1 Comment

In a previous post, we described how we performed exploratory data analysis (EDA) in real-world log files, as provided by Skroutz.gr, the leading online company in Greece for online price comparison, in the context of Athens Datathon 2015. In the present post we will have a look at the same job as performed with Oracle Big Data Discovery (v. 1.1), …

Installing RStudio & additional R packages in Oracle Big Data Lite VM 4.2.1

Christos - Iraklis TsatsoulisR Leave a Comment

I was very happy to find out that, in the latest version (4.2.1) of Oracle Big Data Lite VM, all the R-related issues I had located and reported in the past (see here and here) have been resolved. Nevertheless, some new issues have emerged. Below are my findings and workarounds (if you are in a hurry, feel free to jump …

Big Data Discovery configuration in Oracle Big Data Lite VM 4.2.1

Christos - Iraklis TsatsoulisBig Data, Oracle Big Data Discovery 1 Comment

The latest version (4.2.1) of Oracle Big Data Lite VM, among many additions, now includes also the much-expected Oracle Big Data Discovery (v. 1.1), which I had not played with so far (it is a new product); so I thought to take it for a ride. Since my test data included geolocation attributes (latitude/longitude), one of the first things I …

Installing rJava R package in Oracle Linux

Christos - Iraklis TsatsoulisR 1 Comment

The Oracle Big Data Lite (BDLite) VM is a handy and convenient platform for testing, development, and training on the related tools and technologies, such as Cloudera Hadoop, Oracle NoSQL database, Oracle SQL Developer & Data Modeler etc. Among other things, it includes a full distribution of the Oracle R Enterprise (ORE) and the Oracle R Connectors for Hadoop (ORCH). …

Oracle R Enterprise issues in Oracle Big Data Lite VM 4.1.0

Christos - Iraklis TsatsoulisOracle R 4 Comments

In the previous post, we examined some configuration issues with Cloudera Manager and Hadoop services in the latest release of Oracle Big Data Lite VM (4.1.0). In this post we report issues with Oracle R Enterprise, and the remedies we applied. It turns out that if we load the ORE package in R, we subsequently cannot use the help system …

Cloudera Manager configuration issues in Oracle Big Data Lite VM 4.1.0

Christos - Iraklis TsatsoulisBig Data, Hadoop 2 Comments

Oracle has recently announced the release of a new version (4.1.0) of its Big Data Lite VM. Compared to the previous release (4.0.1), we now have more recent versions of Oracle Enterprise Linux (6.5), Oracle NoSQL database (3.2.5), Cloudera distribution of Apache Hadoop (CDH 5.3.0) and Cloudera Manager (5.3.0). The new version of CDH, by itself, also brings forward several …