📁 Extract, Transform, Load (ETL) 👷 refers to a process in database usage and especially in data warehousing. This repository contains a starter kit featuring ETL related work.
-
Updated
Mar 20, 2017 - Scala
📁 Extract, Transform, Load (ETL) 👷 refers to a process in database usage and especially in data warehousing. This repository contains a starter kit featuring ETL related work.
MapReduce in Nodejs
pagerank hadoop
Map-Reduce jobs in python to get insightful information from NYC Taxi data
MapReduce Framework based on Storm that is flexible for any MapReduce work. Built with a number of workers and a single master.Used BerkeleyDB as temporary data storage in case of big data processing
Recommends movies to the users based on the users profiles and the ratings of other users.
MapReduce Job Development, RDDs Programming, Medical Data Management, Sales Analysis, And Efficient Data Integration For Big Data Analysis. Spark: Big Data Processing, SQOOP Integration, And Spark Structured Streaming For Real-Time Data.
Performed business operations using Big data technologies: AWS EMR, AWS RDS (MySQL), Hadoop, Apache Scoop, Apache HBase, MapReduce
Mapreduce concepts- Secondary sort, counters, mutiple mapreduce jobs
Count the number of times a word occurs in 1GB (Big Data) Dataset of books using hadoop map-reduce
Big data technologies that I have experimented with
Cloud and big data 2017/2018: Programming Assignments
Beta versions/student projects
Hadoop jobs written using GoLang, and run using Hadoop on Docker Containers
Design and implementation of different MapReduce jobs used to analyze a dataset on Covid-19 disease created by Our World In Data
A cloud computing coursework on bigdata etc
Big Data, Hadoop, and MapReduce in Python. MapReduce Jobs using the MRJob library & Amazon Elastic MapReduce service.
Hadoop map-reduce to derive some statistics from Yelp Dataset
Big Data Processing and Analytics course term project.
Run Hadoop Custer within Docker Containers (sequenceiq/hadoop-docker image)
Add a description, image, and links to the mapreduce-jobs topic page so that developers can more easily learn about it.
To associate your repository with the mapreduce-jobs topic, visit your repo's landing page and select "manage topics."