emr
Here are 17 public repositories matching this topic...
A boilerplate for spark projects with docker support for local development and scripts for emr support.
-
Updated
Dec 2, 2017 - Scala
Link Prediction is about predicting the future connections in a graph. In this project, Link Prediction is about predicting whether two authors will be collaborating for their future paper or not given the graph of authors who collaborated for atleast one paper together.
-
Updated
Dec 10, 2019 - Scala
Hadoop Map Reduce
-
Updated
Nov 17, 2021 - Scala
Hadoop MapReduce Programs using Scala to process log files.
-
Updated
Nov 16, 2021 - Scala
Offline Elasticsearch index generator
-
Updated
Jun 5, 2019 - Scala
Infrastructure: The projects herein simplify the repeated use of a variety of frameworks, and cloud services & platforms.
-
Updated
Nov 4, 2023 - Scala
Group 10 Project, Fall 2020, CS 6240: Large-Scale Parallel Data Processing, Khoury College of Computer Sciences, Northeastern University
-
Updated
Jul 12, 2021 - Scala
Parsing the common crawl database using Scala and Spark
-
Updated
Mar 17, 2021 - Scala
Spark code used for my Master's Thesis. Run on AWS EMR clusters
-
Updated
Feb 27, 2018 - Scala
Half-baked implementation of a cluster manager for EMR.
-
Updated
May 15, 2020 - Scala
Improve this page
Add a description, image, and links to the emr topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the emr topic, visit your repo's landing page and select "manage topics."