This is the deck for a presentation I had the pleasure to present in multiple forums over the last year. It’s a short introduction for Oracle personal (DBAs and DB Developers) to the Big Data challenges and solutions. This presentation is focusing on the Hadoop Ecosystem but also shows other solutions – such as Apache Spark.
This is things every DBA needs to know, and not EVERYTHING a DBA needs to know. This is only an introductory to the subject. I also have a 200+ slides deck for getting the in depth view. If someone find this interesting and want to read more, feel free to contact me and I’ll post the longer deck as well.
In the agenda:
- What is the Big Data challenge?
- A Big Data Solution: Apache Hadoop
- MapReduce and YARN
- Hadoop Ecosystem: HBase, Sqoop, Hive, Pig and other tools
- Another Big Data Solution: Apache Spark
- Where does the DBA fits in?
This presentation was presented in BGOUG 2016, ILOUG Tech Days 2016, HROUG 2016 and DOAG 2016 Oracle user groups. I also presented this in smaller, more private sessions.