Things Every Oracle DBA Needs To Know About The Hadoop Ecosystem (slides)

This is the deck for a presentation I had the pleasure to present in multiple forums over the last year. It’s a short introduction for Oracle personal (DBAs and DB Developers) to the Big Data challenges and solutions. This presentation is focusing on the Hadoop Ecosystem but also shows other solutions – such as Apache Spark.

This is things every DBA needs to know, and not EVERYTHING a DBA needs to know. This is only an introductory to the subject. I also have a 200+ slides deck for getting the in depth view. If someone find this interesting and want to read more, feel free to contact me and I’ll post the longer deck as well.

In the agenda:

  • What is the Big Data challenge?
  • A Big Data Solution: Apache Hadoop
    • HDFS
    • MapReduce and YARN
  • Hadoop Ecosystem: HBase, Sqoop, Hive, Pig and other tools
  • Another Big Data Solution: Apache Spark
  • Where does the DBA fits in?

This presentation was presented in BGOUG 2016, ILOUG Tech Days 2016, HROUG 2016 and DOAG 2016 Oracle user groups. I also presented this in smaller, more private sessions.

Read more

Big Data for CIOs Presentation

A few months ago I was asked to give a two hours lecture to a group of CIOs. The topic was a bit vague – “Introduction to Big Data and NoSQL” but I agreed to give it a try anyway.

Since I feel Big Data is such a big topic and since I really wanted to give the CIO so added value, I created this presentation. The aim of the presentation wasn’t to cover all the technological aspects of the topic, but to give some overview and pointers for the future. We talked about basic principles, issues that needs tackling, and solution that might be relevant in the near future. We also talked about NoSQL in order to understand the relation between RDBMS based solution and other kind of solutions.
Read more

Oracle Pre-Built Developer VMs and VMBox

Virtual machines (VM) are not new –it has been around for quite some time, and as a consultant I find myself use them all the time. As a matter of fact, just on my laptop and external drive there are at least 15 or 20 different virtual environment which I use for testing, experimenting, and for creating new blog posts.

The thing with virtual machines that you need to be a little more than just a simple DBA to set it up – you need to know how to install an operating system, configure storage, and get your system ready for database installation, which many junior and less experienced DBAs find problematic at times.

Well, no more! Oracle comes to the rescue and provide us with pre-build developer virtual machines.

Read more

hadoop logo

Introduction to Big Data, Hadoop and NoSQL

In the last few weeks I participated in the training of a DBA course in John Bryce education center in Israel.

The course is titled “Master DBA” – it’s an 8 month evening course to train new DBAs from head to tail. It’s divided into two parts; the first part is about SQL, PL/SQL, and OU “Oracle Database Administration Workshop” parts 1 and 2. The students are then encouraged to take the OCA and OCP certification exams. After finishing this part, the students learn Linux, scripting and other IT skills a DBA should have.

In the second half of the course, the students learn about Big Data (mostly Hadoop), Python (as a programming language) and NoSQL (MongoDB or CouchBase, not really sure).

Yesterday I gave a class about the introduction to Big Data, Hadoop, and NoSQL.

Read more