Log files exploration with Oracle Big Data Discovery 1.1

Christos - Iraklis TsatsoulisBig Data, Exploratory Data Analysis, Oracle Big Data Discovery 1 Comment

In a previous post, we described how we performed exploratory data analysis (EDA) in real-world log files, as provided by Skroutz.gr, the leading online company in Greece for online price comparison, in the context of Athens Datathon 2015. In the present post we will have a look at the same job as performed with Oracle Big Data Discovery (v. 1.1), …

Using Ansible to configure an Oracle Linux 7.1 server with Oracle 12c R1 Enterprise Edition Database

Chris VezalisAnsible, DEVOPS, Linux, Oracle Database, Oracle Linux, Vagrant

Ansible is the leading tool for configuring software and various parameters on servers. It does not require agents and other software installed on nodes like other popular tools (puppet or chef). Also, it is modular and already has hundreds of modules that help us configure our servers in several ways. In this article I will demonstrate how we can install …

Installing RStudio & additional R packages in Oracle Big Data Lite VM 4.2.1

Christos - Iraklis TsatsoulisR Leave a Comment

I was very happy to find out that, in the latest version (4.2.1) of Oracle Big Data Lite VM, all the R-related issues I had located and reported in the past (see here and here) have been resolved. Nevertheless, some new issues have emerged. Below are my findings and workarounds (if you are in a hurry, feel free to jump …

Athens Datathon 2015: exploratory data analysis for anomaly detection & data quality

Christos - Iraklis TsatsoulisData Science, Exploratory Data Analysis, R 8 Comments

Together with my friend and former colleague Georgios Kaiafas, we formed a team to participate to the Athens Datathon 2015, organized by ThinkBiz on October 3; the datathon took place at the premises of Skroutz.gr, which was also the major sponsor and the data provider. It was the second such event organized in Athens, and you can see the Datathon …

Big Data Discovery configuration in Oracle Big Data Lite VM 4.2.1

Christos - Iraklis TsatsoulisBig Data, Oracle Big Data Discovery 1 Comment

The latest version (4.2.1) of Oracle Big Data Lite VM, among many additions, now includes also the much-expected Oracle Big Data Discovery (v. 1.1), which I had not played with so far (it is a new product); so I thought to take it for a ride. Since my test data included geolocation attributes (latitude/longitude), one of the first things I …