Data Engineering


DATA ENGINEERING


Linux Fundamentals

  • Linux commands

  • Linux Filesystem structure

  • Directory related operations

  • Permissions

  • Cron jobs

  • Simple Software installation process (java, maven)

  • Environment, Path variables settings


SQL AND NoSQL

SQL - MYSQL

  • CRUD operations

  • JOINS, UNION

  • FILTERS, Aggregate functions

NoSQL - Dynamo

  • SQL vs NoSQL

  • CRUD Operations

  • Scan with filters


Programming with Python

  • Data Structures(List, Dictionaries, Tuples)

  • Loops

  • Functions

  • Dataframes

  • Numpy

  • Visualisation Matplotlib


Big Data by Capstone Project

  • Hadoop

  • Hive

  • Pig

  • Spark

Aws Cloud

  • S3

  • EC2

  • EMR

  • IAMs


Off-Track Guest Lectures

Developer

  • REST APIs

DevOps

  • GIT

  • CICD Pipelines

Trending Technologies

  • Kafka