Hacker List


HackerList Ranksβ„’


Stacks / Topics


nuker πŸ‘¨β€πŸ’»πŸš€ 139

Cleans up AWS resources based on configurable Rules.

Rust
Dockerfile
Shell
139
3
0

nuker

Cleans up AWS resources based on configurable Rules.

Rust
Dockerfile
Shell
139
3
0

ankus πŸ‘¨β€πŸ’»πŸš€ 670

ANKUS is a deployment & orchestration tool for big data frameworks

Ruby
Shell
670
21
9

ankus

ANKUS is a deployment & orchestration tool for big data frameworks

Ruby
Shell
670
21
9

ankus-modules πŸ‘¨β€πŸ’»πŸš€ 191

modules used by ankus to manage big-data frameworks

Puppet
Ruby
Emacs Lisp
191
5
3

ankus-modules

modules used by ankus to manage big-data frameworks

Puppet
Ruby
Emacs Lisp
191
5
3

scripts πŸ‘¨β€πŸ’»πŸš€ 43

Cloudwick Deployment Scripts

Shell
Ruby
43
2
0

scripts

Cloudwick Deployment Scripts

Shell
Ruby
43
2
0

LogEventsProcessing πŸ‘¨β€πŸ’»πŸš€ 11

real time log event processing using storm, kafka, logstash & cassandra

Java
11
47
25

LogEventsProcessing

real time log event processing using storm, kafka, logstash & cassandra

Java
11
47
25

generator πŸ‘¨β€πŸ’»πŸš€ 42

Synthetic data generators for simulating real-time data and work loads

Shell
Scala
Java
42
10
4

generator

Synthetic data generators for simulating real-time data and work loads

Shell
Scala
Java
42
10
4

awscli πŸ‘¨β€πŸ’»πŸš€ 101

amazon web services command line interface in ruby

Ruby
101
10
4

awscli

amazon web services command line interface in ruby

Ruby
101
10
4

time-tracker πŸ‘¨β€πŸ’»πŸš€ 41

Play application for management of timesheets, invoices, expense management and others

Scala
HTML
JavaScript
CSS
41
0
0

time-tracker

Play application for management of timesheets, invoices, expense management and others

Scala
HTML
JavaScript
CSS
41
0
0

benchmark πŸ‘¨β€πŸ’»πŸš€ 48

Application to benchmark inserts, reads and queries of nosql data-stores

Scala
Shell
Java
48
12
4

benchmark

Application to benchmark inserts, reads and queries of nosql data-stores

Scala
Shell
Java
48
12
4

storm-helloworld πŸ‘¨β€πŸ’»πŸš€ 4

sample hello world topology with pom

Java
4
13
25

storm-helloworld

sample hello world topology with pom

Java
4
13
25

LogEventsProcessingSpark πŸ‘¨β€πŸ’»πŸš€ 7

real time log event processing using spark, kafka & cassandra

Scala
Java
Shell
7
13
18

LogEventsProcessingSpark

real time log event processing using spark, kafka & cassandra

Scala
Java
Shell
7
13
18

dotfiles πŸ‘¨β€πŸ’»πŸš€ 61

Dotfiles

Shell
Ruby
Emacs Lisp
61
3
0

dotfiles

Dotfiles

Shell
Ruby
Emacs Lisp
61
3
0

scripts πŸ‘¨β€πŸ’»πŸš€ 55

day-to-day dev-ops scripts

Ruby
Shell
55
1
5

scripts

day-to-day dev-ops scripts

Ruby
Shell
55
1
5

bulkload_mongo_mapreduce πŸ‘¨β€πŸ’»πŸš€ 50

simple mapreduce program to bulk load hdfs data into mongodb

Java
50
0
0

bulkload_mongo_mapreduce

simple mapreduce program to bulk load hdfs data into mongodb

Java
50
0
0

spark_codebase πŸ‘¨β€πŸ’»πŸš€ 22

Collection of Spark core, streaming, sql, mllib examples & applications with base line unit tests

Scala
22
6
4

spark_codebase

Collection of Spark core, streaming, sql, mllib examples & applications with base line unit tests

Scala
22
6
4

spark-starter πŸ‘¨β€πŸ’»πŸš€ 3

Sample Spark start application illustration wordcount with testsuite

Scala
3
1
3

spark-starter

Sample Spark start application illustration wordcount with testsuite

Scala
3
1
3

dscb πŸ‘¨β€πŸ’»πŸš€ 16

Distributed Systems Code Base (for training purposes)

Scala
Thrift
16
3
1

dscb

Distributed Systems Code Base (for training purposes)

Scala
Thrift
16
3
1

puppet_kerberos πŸ‘¨β€πŸ’»πŸš€ 11

puppet module to install kerberos

Ruby
Puppet
11
1
2

puppet_kerberos

puppet module to install kerberos

Ruby
Puppet
11
1
2

index_tweets_solr πŸ‘¨β€πŸ’»πŸš€ 2

Indexes Twitter tweets using Solr

Java
Shell
2
2
0

index_tweets_solr

Indexes Twitter tweets using Solr

Java
Shell
2
2
0

kafka_code_base πŸ‘¨β€πŸ’»πŸš€ 3

Simple Kafka producer, consumer examples

Scala
Java
3
1
1

kafka_code_base

Simple Kafka producer, consumer examples

Scala
Java
3
1
1

mapreduce_training πŸ‘¨β€πŸ’»πŸš€ 8

Set of MapReduce application's used for teaching purposes

Java
8
3
5

mapreduce_training

Set of MapReduce application's used for teaching purposes

Java
8
3
5

log_analytics_mapreduce πŸ‘¨β€πŸ’»πŸš€ 12

Analytics on top of http webserver log events using mapreduce

Java
12
5
6

log_analytics_mapreduce

Analytics on top of http webserver log events using mapreduce

Java
12
5
6

ncdc_data_processing πŸ‘¨β€πŸ’»πŸš€ 1

Process/Analyze NCDC weather dataset using hadoop mapreduce

Java
Shell
1
1
5

ncdc_data_processing

Process/Analyze NCDC weather dataset using hadoop mapreduce

Java
Shell
1
1
5

cm_automation πŸ‘¨β€πŸ’»πŸš€ 11

Cloudera Manager automation using Puppet, Chef & CM API

Ruby
Puppet
11
0
2

cm_automation

Cloudera Manager automation using Puppet, Chef & CM API

Ruby
Puppet
11
0
2

clickstream_generator πŸ‘¨β€πŸ’»πŸš€ 1

Synthetic data generator for generating clickstream data

Scala
1
1
1

clickstream_generator

Synthetic data generator for generating clickstream data

Scala
1
1
1

puppet_java πŸ‘¨β€πŸ’»πŸš€ 10

puppet module to install and manage java

Puppet
10
1
1

puppet_java

puppet module to install and manage java

Puppet
10
1
1

http_events_gen πŸ‘¨β€πŸ’»πŸš€ 4

mocks http web requests (apache web server format)

Scala
Ruby
4
1
2

http_events_gen

mocks http web requests (apache web server format)

Scala
Ruby
4
1
2

puppet_module_scm πŸ‘¨β€πŸ’»πŸš€ 2

puppet module to deploy and manage cloudera manager

Puppet
2
1
2

puppet_module_scm

puppet module to deploy and manage cloudera manager

Puppet
2
1
2
19
0
0
Python
Shell
19
0
0

s3-restore πŸ‘¨β€πŸ’»πŸš€ 11

Restores deleted objects of an S3 version enabled bucket

Go
Dockerfile
11
0
0

s3-restore

Restores deleted objects of an S3 version enabled bucket

Go
Dockerfile
11
0
0
5
0
0
Rust
5
0
0

udmp-wpa-supplicant-monitor πŸ‘¨β€πŸ’»πŸš€ 2

Monitors the wpa-supplicant container

Go
Dockerfile
2
0
0

udmp-wpa-supplicant-monitor

Monitors the wpa-supplicant container

Go
Dockerfile
2
0
0
9
0
1
HTML
CSS
9
0
1

play-cloudwickone-slick-template πŸ‘¨β€πŸ’»πŸš€ 1

Play slick (postgres) template

Scala
HTML
JavaScript
CSS
1
1
0

play-cloudwickone-slick-template

Play slick (postgres) template

Scala
HTML
JavaScript
CSS
1
1
0

centos-base πŸ‘¨β€πŸ’»πŸš€ 4

Docker CentOS base image with SSH and Supervisor configured

Shell
4
0
1

centos-base

Docker CentOS base image with SSH and Supervisor configured

Shell
4
0
1

mapreduce_joins πŸ‘¨β€πŸ’»πŸš€ 6

Simple examples to illustrate joins in mapreduce

Java
6
0
1

mapreduce_joins

Simple examples to illustrate joins in mapreduce

Java
6
0
1

game_data_gen πŸ‘¨β€πŸ’»πŸš€ 6

Mocks data generated by gaming website

Ruby
6
0
2

game_data_gen

Mocks data generated by gaming website

Ruby
6
0
2

puppet_logstash πŸ‘¨β€πŸ’»πŸš€ 9

puppet module to install & manage logstash, lumberjack

Ruby
Puppet
9
0
1

puppet_logstash

puppet module to install & manage logstash, lumberjack

Ruby
Puppet
9
0
1

movie_data_gen πŸ‘¨β€πŸ’»πŸš€ 8

set of programs to generate random movie data set

Ruby
8
0
1

movie_data_gen

set of programs to generate random movie data set

Ruby
8
0
1

analytics-game-demo πŸ‘¨β€πŸ’»πŸš€ 6

Analytics demo on game data

R
Ruby
Shell
6
0
1

analytics-game-demo

Analytics demo on game data

R
Ruby
Shell
6
0
1

puppet_zookeeper πŸ‘¨β€πŸ’»πŸš€ 6

puppet module to install and manage apache zookeeper

Puppet
6
0
1

puppet_zookeeper

puppet module to install and manage apache zookeeper

Puppet
6
0
1

cm_api πŸ‘¨β€πŸ’»πŸš€ 1

illustrates cloudera manager api use with ruby (httparty)

Ruby
1
0
1

cm_api

illustrates cloudera manager api use with ruby (httparty)

Ruby
1
0
1

index_logs πŸ‘¨β€πŸ’»πŸš€ 1

Scala application to interface with solr for indexing and querying apache http log events

Shell
Scala
1
0
1

index_logs

Scala application to interface with solr for indexing and querying apache http log events

Shell
Scala
1
0
1

sinatra_delayed_job_active_record πŸ‘¨β€πŸ’»πŸš€ 3

Simple project illustrating how to use Delayed_job active record with Sinatra

Ruby
Shell
3
0
0

sinatra_delayed_job_active_record

Simple project illustrating how to use Delayed_job active record with Sinatra

Ruby
Shell
3
0
0

play-cloudwickone-mongo-template πŸ‘¨β€πŸ’»πŸš€ 5

Play MongoDB template

Scala
HTML
JavaScript
CSS
5
0
0
Scala
HTML
JavaScript
CSS
5
0
0

cloudwick_one_template πŸ‘¨β€πŸ’»πŸš€ 1

Bootstrap template for Cloudwick One

JavaScript
HTML
1
0
0

cloudwick_one_template

Bootstrap template for Cloudwick One

JavaScript
HTML
1
0
0
2
0
0

docker-hadoop-worker

Docker Hadoop Worker Image

Shell
2
0
0
2
0
0

docker-hadoop-master

Hadoop Master Docker Image

Shell
2
0
0

puppet_module_mongo πŸ‘¨β€πŸ’»πŸš€ 13

Puppet module to manage mongodb

Puppet
13
0
0

puppet_module_mongo

Puppet module to manage mongodb

Puppet
13
0
0

sinatra_delayedjob_mongoid πŸ‘¨β€πŸ’»πŸš€ 3

Example usage of sinatra with delayed_job and mongoid

Ruby
3
0
0

sinatra_delayedjob_mongoid

Example usage of sinatra with delayed_job and mongoid

Ruby
3
0
0

deb-pkgs πŸ‘¨β€πŸ’»πŸš€ 13

Build debian packages for some big data projects

Shell
13
0
0

deb-pkgs

Build debian packages for some big data projects

Shell
13
0
0

puppet_kafka πŸ‘¨β€πŸ’»πŸš€ 6

puppet module to install kafka 0.8

Puppet
6
0
0

puppet_kafka

puppet module to install kafka 0.8

Puppet
6
0
0

puppet_storm πŸ‘¨β€πŸ’»πŸš€ 4

puppet module to deploy storm

Shell
Puppet
4
0
0

puppet_storm

puppet module to deploy storm

Shell
Puppet
4
0
0

blog-engine πŸ‘¨β€πŸ’»πŸš€ 3

a simple blog engine using mongo & sinatra

Ruby
CSS
3
0
0

blog-engine

a simple blog engine using mongo & sinatra

Ruby
CSS
3
0
0

puppet_jmxtrans πŸ‘¨β€πŸ’»πŸš€ 3

puppet module to install & manage jmxtrans

Puppet
3
0
0

puppet_jmxtrans

puppet module to install & manage jmxtrans

Puppet
3
0
0

puppet_ganglia πŸ‘¨β€πŸ’»πŸš€ 3

puppet module to install ganglia

Puppet
3
0
0

puppet_ganglia

puppet module to install ganglia

Puppet
3
0
0

puppet_module_base πŸ‘¨β€πŸ’»πŸš€ 2

puppet module to manage user's and install zsh(oh-my-zsh) for user

Puppet
2
0
0

puppet_module_base

puppet module to manage user's and install zsh(oh-my-zsh) for user

Puppet
2
0
0

puppet_scala πŸ‘¨β€πŸ’»πŸš€ 1

puppet module to install scala

Puppet
1
0
0

puppet_scala

puppet module to install scala

Puppet
1
0
0

tana-readwise-exporter πŸ‘¨β€πŸ’»πŸš€ 1

Export Readwise highlights to Tana

Python
1
0
0

tana-readwise-exporter

Export Readwise highlights to Tana

Python
1
0
0

tana-readwise-exporter πŸ‘¨β€πŸ’»πŸš€ 1

CLI to export readwise highlights to Tana.io

Go
1
0
0

tana-readwise-exporter

CLI to export readwise highlights to Tana.io

Go
1
0
0

chef-repo πŸ‘¨β€πŸ’»πŸš€ 16

base for chef code

Ruby
Perl
C
16
0
0

chef-repo

base for chef code

Ruby
Perl
C
16
0
0

navi πŸ‘¨β€πŸ’»πŸš€ 1

An interactive cheatsheet tool for the command-line

Shell
Makefile
Rust
1
10183
408

navi

An interactive cheatsheet tool for the command-line

Shell
Makefile
Rust
1
10183
408

slack-scala-client πŸ‘¨β€πŸ’»πŸš€ 1

A scala library for interacting with the slack api and real time messaging interface

Scala
1
180
110

slack-scala-client

A scala library for interacting with the slack api and real time messaging interface

Scala
1
180
110
2
0
0
2
0
0

awesome-bigdata πŸ‘¨β€πŸ’»πŸš€ 2

A curated list of awesome big data frameworks, ressources and other awesomeness.

2
10553
2383

awesome-bigdata

A curated list of awesome big data frameworks, ressources and other awesomeness.

2
10553
2383

docker-hadoop-base πŸ‘¨β€πŸ’»πŸš€ 1

Hadoop base image for Docker built on centos 7

1
0
0

docker-hadoop-base

Hadoop base image for Docker built on centos 7

1
0
0
1
0
1
1
0
1

hadoop πŸ‘¨β€πŸ’»πŸš€ 16

Set of Hadoop related UseCases

16
0
1

hadoop

Set of Hadoop related UseCases

16
0
1

ankus πŸ‘¨β€πŸ’»πŸš€ 371

DEPRICATED. Project maintained at https://github.com/cloudwicklabs/ankus

371
0
1

ankus

DEPRICATED. Project maintained at https://github.com/cloudwicklabs/ankus

371
0
1

realtime_processing πŸ‘¨β€πŸ’»πŸš€ 6

Set of UseCases for solving BigData RealTime Processing

6
1
3

realtime_processing

Set of UseCases for solving BigData RealTime Processing

6
1
3

datagenerators πŸ‘¨β€πŸ’»πŸš€ 11

[DEPRICATED] Use https://github.com/cloudwicklabs/generator

11
0
2

datagenerators

[DEPRICATED] Use https://github.com/cloudwicklabs/generator

11
0
2

rpm-specs πŸ‘¨β€πŸ’»πŸš€ 14

rpm spec files for various big data projects

14
2
1

rpm-specs

rpm spec files for various big data projects

14
2
1

flume_filtering πŸ‘¨β€πŸ’»πŸš€ 2

Filter HTTP log events based on status_codes using Interceptors and ChannelSelectors

2
0
6

flume_filtering

Filter HTTP log events based on status_codes using Interceptors and ChannelSelectors

2
0
6
Question Score: 92
Answer Score: 118
πŸ‘€ 322930

hadoop copy a local file system folder to HDFS

hadoop
hdfs
accepted answer βœ…
Question Score: 28
Answer Score: 73
πŸ‘€ 44555

hadoop fs -ls results in "no such file or directory"

hadoop
uri
hdfs
accepted answer βœ…
Question Score: 34
Answer Score: 55
πŸ‘€ 36135

How can I access S3/S3n from a local Hadoop 2.6 installation?

hadoop
amazon-web-services
amazon-s3
hadoop-yarn
hadoop2
accepted answer βœ…
Question Score: 11
Answer Score: 25
πŸ‘€ 7613

namespace image and edit log

hadoop
hdfs
hadoop2
accepted answer βœ…
Question Score: 3
Answer Score: 16
πŸ‘€ 1965

Can Apache Sqoop and Flume be used interchangeably?

hadoop
bigdata
sqoop
flume
Question Score: 6
Answer Score: 14
πŸ‘€ 24760

What is the maximum container(s) in a single-node cluster (hadoop)?

apache
hadoop
mapreduce
hadoop-yarn
hadoop2
accepted answer βœ…
Question Score: 10
Answer Score: 12
πŸ‘€ 16129

hadoop fs -text file returns "text: Unable to write to output stream."

hadoop
accepted answer βœ…
Question Score: 5
Answer Score: 11
πŸ‘€ 21661

How to upload file to HDFS in Ubuntu

hadoop
hdfs
accepted answer βœ…
Question Score: 3
Answer Score: 11
πŸ‘€ 3733

Data lost after shutting down hadoop HDFS?

hadoop
hdfs
Question Score: 361
Answer Score: 10
πŸ‘€ 304251

Getting output of system() calls in Ruby

ruby
system
call
accepted answer βœ…
Question Score: 7
Answer Score: 10
πŸ‘€ 14874

Host and port to use to list a directory in hdfs

java
hadoop
hdfs
hortonworks-data-platform
Question Score: 8
Answer Score: 10
πŸ‘€ 16963

Different ways to import files into HDFS

hadoop
import
hdfs
accepted answer βœ…
Question Score: 1
Answer Score: 10
πŸ‘€ 7535

relation between number of input splits and number of mappers in mapreduce hadoop

hadoop
mapreduce
accepted answer βœ…
Question Score: 5
Answer Score: 9
πŸ‘€ 15503

Standard practices for logging in MapReduce jobs

java
hadoop
mapreduce
hadoop2
mapr
Question Score: 9
Answer Score: 8
πŸ‘€ 5751

Accessing a file that is being written

hadoop
hdfs
accepted answer βœ…
Question Score: 10
Answer Score: 8
πŸ‘€ 24081

Opening a file stored in HDFS to edit in VI

ubuntu
hadoop
hdfs
vi
accepted answer βœ…
Question Score: 5
Answer Score: 7
πŸ‘€ 7974

mapreduce in java - gzip input files

java
hadoop
mapreduce
gzip
accepted answer βœ…
Question Score: 3
Answer Score: 7
πŸ‘€ 8673

MapReduce: How to get mapper to process multiple lines?

java
hadoop
input
split
mapreduce
Question Score: 6
Answer Score: 6
πŸ‘€ 9500

MapReduce or Spark for Batch processing on Hadoop?

hadoop
mapreduce
batch-processing
apache-spark
accepted answer βœ…
Question Score: 4
Answer Score: 6
πŸ‘€ 1572

HBase: How does data get written in a sorted manner into HFile?

hbase
hfile
Question Score: 3
Answer Score: 6
πŸ‘€ 17629

how to start and check job history on hadoop 2.5.2

hadoop
Question Score: 1
Answer Score: 6
πŸ‘€ 6827

Complete list of property that is used in Hadoop framework

java
hadoop
dictionary
mapreduce
hdfs
Question Score: 11
Answer Score: 6
πŸ‘€ 54292

Where is the classpath set for hadoop

hadoop
mapreduce
hadoop2
accepted answer βœ…
Question Score: 2
Answer Score: 5
πŸ‘€ 3204

Hadoop: Getting the input file name in the mapper only once

hadoop
mapreduce
accepted answer βœ…
Question Score: 0
Answer Score: 5
πŸ‘€ 2248

MRv2 / YARN Features

hadoop
mrv2
Question Score: 14
Answer Score: 5
πŸ‘€ 14713

.sparkstaging directory in hdfs is not deleted

apache-spark
accepted answer βœ…
Question Score: 2
Answer Score: 4
πŸ‘€ 1319

Hadoop own data types

hadoop
types
Question Score: 3
Answer Score: 3
πŸ‘€ 1143

How should I persist my event stream to cold storage?

hadoop
bigdata
apache-kafka
amazon-kinesis
azure-eventhub
accepted answer βœ…
Question Score: 3
Answer Score: 3
πŸ‘€ 7156

How to get data from HDFS? Hive?

hadoop
hive
accepted answer βœ…
Question Score: 0
Answer Score: 3
πŸ‘€ 581

What is the -file argument for AWS EMR

hadoop
amazon-web-services
amazon-emr