Enabling the Green-Marl compiler for Parallel Graph Analytics in Oracle Big Data Lite VM

Panagiotis Konstantinidis Oracle Big Data Spatial & Graph 0 Comments

Recently, I began working with Parallel Graph Analytics (PGX) on my Oracle Big Data Lite (BDL) VM version 4.7.0.1. I was especially intrigued and curious about the capabilities of a PGX component called Green-Marl (GM), a domain-specific language specially designed for graph data analysis. It was stated to extend PGX’s capabilities and “implement algorithms with no limit”. Especially the last argument …

Streaming data from Raspberry Pi to Oracle NoSQL via Node-RED

Christos - Iraklis Tsatsoulis Internet of Things, Node-RED, Oracle NoSQL, Raspberry Pi 0 Comments

Starting from version 4.2, Oracle NoSQL now offers drivers for Node.js and Python, in addition to the existing ones for Java, C, and C++; this is good news for data science people, like myself, since we are normally not accustomed to code in Java or C/C++. So, I thought to build a short demo project, putting into test both the …

Installing the additional R packages in Oracle Big Data Lite VM 4.5.0

Christos - Iraklis Tsatsoulis R 2 Comments

Oracle has just released version 4.5.0 of the Big Data Lite VM which, when it comes to R, still suffers from the issues we had pinpointed for the previous version 4.4.0 (and then some). The first attempt to install the additional packages fails with a ‘cannot open URL’ error: Fortunately, the warning about the proxy helps to locate the issue, …

Bulk load data to HBase in Oracle Big Data Appliance

Christos - Iraklis Tsatsoulis Big Data, HBase 0 Comments

I ran into an issue recently, while trying to bulk load some data to HBase in Oracle Big Data Appliance. Following is a reproducible description and solution using the current version of Oracle Big Data Lite VM (4.4.0). Enabling HBase in Oracle Big Data Lite VM (Feel free to skip this section if you do not use Oracle Big Data …

Installing the additional R packages in Oracle Big Data Lite VM 4.4.0

Christos - Iraklis Tsatsoulis R 0 Comments

In the just-released version 4.4.0 of Oracle Big Data Lite VM, as in the previous one (4.3.0.1), there is a rather large number of additional R packages to be installed by the provided script install_additional_packages.sh, i.e. 28 packages without counting their dependencies (the respective number in version 4.2.1 was only 10). Unfortunately, what has also changed is the form of …

Querying Big Data SQL tables with Oracle R Enterprise

Christos - Iraklis Tsatsoulis Big Data, Oracle Big Data SQL, Oracle R 0 Comments

I was wondering recently if I could use Oracle R Enterprise (ORE) to query Big Data SQL tables (i.e. Oracle Database external tables based on HDFS or Hive data), since I have never seen such a combination mentioned in the relevant Oracle documentation and white papers. I am happy to announce that the answer is an unconditional yes. In this …

Oracle R Enterprise 1.4: ore.make.names does not work for Oracle DB connections

Christos - Iraklis Tsatsoulis Oracle R 0 Comments

I have reported in the past about some unexpected behavior issues of Oracle R Enterprise 1.4 ore.make.names function; nevertheless, back then I had only tried it with Hive connections. I tried to use it today with an Oracle database connection, and it doesn’t seem to work. Here is a reproducible example in Oracle Big Data Lite VM 4.2.1, using the …

Manipulating Hive tables with Oracle R connectors for Hadoop

Christos - Iraklis Tsatsoulis Hadoop, Hive, Oracle R 0 Comments

In this post, we’ll have a look at how easy it is to manipulate Hive tables using Oracle R connectors for Hadoop (ORCH, presently known as Oracle R Advanced Analytics for Hadoop – ORAAH). We will use the weblog data from Athens Datathon 2015, which we have already loaded in a Hive table named weblogs, as described in more detail …

Log files exploration with Oracle Big Data Discovery 1.1

Christos - Iraklis Tsatsoulis Big Data, Exploratory Data Analysis, Oracle Big Data Discovery 1 Comment

In a previous post, we described how we performed exploratory data analysis (EDA) in real-world log files, as provided by Skroutz.gr, the leading online company in Greece for online price comparison, in the context of Athens Datathon 2015. In the present post we will have a look at the same job as performed with Oracle Big Data Discovery (v. 1.1), …