Querying Big Data SQL tables with Oracle R Enterprise

Christos - Iraklis TsatsoulisBig Data, Oracle Big Data SQL, Oracle R 1 Comment

I was wondering recently if I could use Oracle R Enterprise (ORE) to query Big Data SQL tables (i.e. Oracle Database external tables based on HDFS or Hive data), since I have never seen such a combination mentioned in the relevant Oracle documentation and white papers. I am happy to announce that the answer is an unconditional yes. In this …

Manipulating Hive tables with Oracle R connectors for Hadoop

Christos - Iraklis TsatsoulisHadoop, Hive, Oracle R 2 Comments

In this post, we’ll have a look at how easy it is to manipulate Hive tables using Oracle R connectors for Hadoop (ORCH, presently known as Oracle R Advanced Analytics for Hadoop – ORAAH). We will use the weblog data from Athens Datathon 2015, which we have already loaded in a Hive table named weblogs, as described in more detail …

Log files exploration with Oracle Big Data Discovery 1.1

Christos - Iraklis TsatsoulisBig Data, Exploratory Data Analysis, Oracle Big Data Discovery 1 Comment

In a previous post, we described how we performed exploratory data analysis (EDA) in real-world log files, as provided by Skroutz.gr, the leading online company in Greece for online price comparison, in the context of Athens Datathon 2015. In the present post we will have a look at the same job as performed with Oracle Big Data Discovery (v. 1.1), …

Athens Datathon 2015: exploratory data analysis for anomaly detection & data quality

Christos - Iraklis TsatsoulisData Science, Exploratory Data Analysis, R 8 Comments

Together with my friend and former colleague Georgios Kaiafas, we formed a team to participate to the Athens Datathon 2015, organized by ThinkBiz on October 3; the datathon took place at the premises of Skroutz.gr, which was also the major sponsor and the data provider. It was the second such event organized in Athens, and you can see the Datathon …