Hacker List


HackerList Ranksβ„’


Stacks / Topics


nuker πŸ‘¨β€πŸ’»πŸš€ 139

Cleans up AWS resources based on configurable Rules.

Rust
Dockerfile
Shell

⭐3

πŸ––0

nuker

Cleans up AWS resources based on configurable Rules.

Rust
Dockerfile
Shell
πŸ‘¨β€πŸ’»πŸš€ 139 ⭐3 πŸ––0

ankus πŸ‘¨β€πŸ’»πŸš€ 670

ANKUS is a deployment & orchestration tool for big data frameworks

Ruby
Shell

⭐21

πŸ––9

ankus

ANKUS is a deployment & orchestration tool for big data frameworks

Ruby
Shell
πŸ‘¨β€πŸ’»πŸš€ 670 ⭐21 πŸ––9

ankus-modules πŸ‘¨β€πŸ’»πŸš€ 191

modules used by ankus to manage big-data frameworks

Puppet
Ruby
Emacs Lisp

⭐5

πŸ––3

ankus-modules

modules used by ankus to manage big-data frameworks

Puppet
Ruby
Emacs Lisp
πŸ‘¨β€πŸ’»πŸš€ 191 ⭐5 πŸ––3

scripts πŸ‘¨β€πŸ’»πŸš€ 43

Cloudwick Deployment Scripts

Shell
Ruby

⭐2

πŸ––0

scripts

Cloudwick Deployment Scripts

Shell
Ruby
πŸ‘¨β€πŸ’»πŸš€ 43 ⭐2 πŸ––0

LogEventsProcessing πŸ‘¨β€πŸ’»πŸš€ 11

real time log event processing using storm, kafka, logstash & cassandra

Java

⭐47

πŸ––25

LogEventsProcessing

real time log event processing using storm, kafka, logstash & cassandra

Java
πŸ‘¨β€πŸ’»πŸš€ 11 ⭐47 πŸ––25

generator πŸ‘¨β€πŸ’»πŸš€ 42

Synthetic data generators for simulating real-time data and work loads

Shell
Scala
Java

⭐10

πŸ––4

generator

Synthetic data generators for simulating real-time data and work loads

Shell
Scala
Java
πŸ‘¨β€πŸ’»πŸš€ 42 ⭐10 πŸ––4

awscli πŸ‘¨β€πŸ’»πŸš€ 101

amazon web services command line interface in ruby

Ruby

⭐10

πŸ––4

awscli

amazon web services command line interface in ruby

Ruby
πŸ‘¨β€πŸ’»πŸš€ 101 ⭐10 πŸ––4

time-tracker πŸ‘¨β€πŸ’»πŸš€ 41

Play application for management of timesheets, invoices, expense management and others

Scala
HTML
JavaScript
CSS

⭐0

πŸ––0

time-tracker

Play application for management of timesheets, invoices, expense management and others

Scala
HTML
JavaScript
CSS
πŸ‘¨β€πŸ’»πŸš€ 41 ⭐0 πŸ––0

benchmark πŸ‘¨β€πŸ’»πŸš€ 48

Application to benchmark inserts, reads and queries of nosql data-stores

Scala
Shell
Java

⭐12

πŸ––4

benchmark

Application to benchmark inserts, reads and queries of nosql data-stores

Scala
Shell
Java
πŸ‘¨β€πŸ’»πŸš€ 48 ⭐12 πŸ––4

storm-helloworld πŸ‘¨β€πŸ’»πŸš€ 4

sample hello world topology with pom

Java

⭐13

πŸ––25

storm-helloworld

sample hello world topology with pom

Java
πŸ‘¨β€πŸ’»πŸš€ 4 ⭐13 πŸ––25

LogEventsProcessingSpark πŸ‘¨β€πŸ’»πŸš€ 7

real time log event processing using spark, kafka & cassandra

Scala
Java
Shell

⭐13

πŸ––18

LogEventsProcessingSpark

real time log event processing using spark, kafka & cassandra

Scala
Java
Shell
πŸ‘¨β€πŸ’»πŸš€ 7 ⭐13 πŸ––18

scripts πŸ‘¨β€πŸ’»πŸš€ 55

day-to-day dev-ops scripts

Ruby
Shell

⭐1

πŸ––5

scripts

day-to-day dev-ops scripts

Ruby
Shell
πŸ‘¨β€πŸ’»πŸš€ 55 ⭐1 πŸ––5

dotfiles πŸ‘¨β€πŸ’»πŸš€ 57

Dotfiles

Shell
Ruby
Emacs Lisp

⭐3

πŸ––0

dotfiles

Dotfiles

Shell
Ruby
Emacs Lisp
πŸ‘¨β€πŸ’»πŸš€ 57 ⭐3 πŸ––0

bulkload_mongo_mapreduce πŸ‘¨β€πŸ’»πŸš€ 50

simple mapreduce program to bulk load hdfs data into mongodb

Java

⭐0

πŸ––0

bulkload_mongo_mapreduce

simple mapreduce program to bulk load hdfs data into mongodb

Java
πŸ‘¨β€πŸ’»πŸš€ 50 ⭐0 πŸ––0

spark_codebase πŸ‘¨β€πŸ’»πŸš€ 22

Collection of Spark core, streaming, sql, mllib examples & applications with base line unit tests

Scala

⭐6

πŸ––4

spark_codebase

Collection of Spark core, streaming, sql, mllib examples & applications with base line unit tests

Scala
πŸ‘¨β€πŸ’»πŸš€ 22 ⭐6 πŸ––4

spark-starter πŸ‘¨β€πŸ’»πŸš€ 3

Sample Spark start application illustration wordcount with testsuite

Scala

⭐1

πŸ––3

spark-starter

Sample Spark start application illustration wordcount with testsuite

Scala
πŸ‘¨β€πŸ’»πŸš€ 3 ⭐1 πŸ––3

dscb πŸ‘¨β€πŸ’»πŸš€ 16

Distributed Systems Code Base (for training purposes)

Scala
Thrift

⭐3

πŸ––1

dscb

Distributed Systems Code Base (for training purposes)

Scala
Thrift
πŸ‘¨β€πŸ’»πŸš€ 16 ⭐3 πŸ––1

puppet_kerberos πŸ‘¨β€πŸ’»πŸš€ 11

puppet module to install kerberos

Ruby
Puppet

⭐1

πŸ––2

puppet_kerberos

puppet module to install kerberos

Ruby
Puppet
πŸ‘¨β€πŸ’»πŸš€ 11 ⭐1 πŸ––2

index_tweets_solr πŸ‘¨β€πŸ’»πŸš€ 2

Indexes Twitter tweets using Solr

Java
Shell

⭐2

πŸ––0

index_tweets_solr

Indexes Twitter tweets using Solr

Java
Shell
πŸ‘¨β€πŸ’»πŸš€ 2 ⭐2 πŸ––0

kafka_code_base πŸ‘¨β€πŸ’»πŸš€ 3

Simple Kafka producer, consumer examples

Scala
Java

⭐1

πŸ––1

kafka_code_base

Simple Kafka producer, consumer examples

Scala
Java
πŸ‘¨β€πŸ’»πŸš€ 3 ⭐1 πŸ––1

mapreduce_training πŸ‘¨β€πŸ’»πŸš€ 8

Set of MapReduce application's used for teaching purposes

Java

⭐3

πŸ––5

mapreduce_training

Set of MapReduce application's used for teaching purposes

Java
πŸ‘¨β€πŸ’»πŸš€ 8 ⭐3 πŸ––5

log_analytics_mapreduce πŸ‘¨β€πŸ’»πŸš€ 12

Analytics on top of http webserver log events using mapreduce

Java

⭐5

πŸ––6

log_analytics_mapreduce

Analytics on top of http webserver log events using mapreduce

Java
πŸ‘¨β€πŸ’»πŸš€ 12 ⭐5 πŸ––6

ncdc_data_processing πŸ‘¨β€πŸ’»πŸš€ 1

Process/Analyze NCDC weather dataset using hadoop mapreduce

Java
Shell

⭐1

πŸ––5

ncdc_data_processing

Process/Analyze NCDC weather dataset using hadoop mapreduce

Java
Shell
πŸ‘¨β€πŸ’»πŸš€ 1 ⭐1 πŸ––5

cm_automation πŸ‘¨β€πŸ’»πŸš€ 11

Cloudera Manager automation using Puppet, Chef & CM API

Ruby
Puppet

⭐0

πŸ––2

cm_automation

Cloudera Manager automation using Puppet, Chef & CM API

Ruby
Puppet
πŸ‘¨β€πŸ’»πŸš€ 11 ⭐0 πŸ––2

clickstream_generator πŸ‘¨β€πŸ’»πŸš€ 1

Synthetic data generator for generating clickstream data

Scala

⭐1

πŸ––1

clickstream_generator

Synthetic data generator for generating clickstream data

Scala
πŸ‘¨β€πŸ’»πŸš€ 1 ⭐1 πŸ––1

puppet_java πŸ‘¨β€πŸ’»πŸš€ 10

puppet module to install and manage java

Puppet

⭐1

πŸ––1

puppet_java

puppet module to install and manage java

Puppet
πŸ‘¨β€πŸ’»πŸš€ 10 ⭐1 πŸ––1

http_events_gen πŸ‘¨β€πŸ’»πŸš€ 4

mocks http web requests (apache web server format)

Scala
Ruby

⭐1

πŸ––2

http_events_gen

mocks http web requests (apache web server format)

Scala
Ruby
πŸ‘¨β€πŸ’»πŸš€ 4 ⭐1 πŸ––2

puppet_module_scm πŸ‘¨β€πŸ’»πŸš€ 2

puppet module to deploy and manage cloudera manager

Puppet

⭐1

πŸ––2

puppet_module_scm

puppet module to deploy and manage cloudera manager

Puppet
πŸ‘¨β€πŸ’»πŸš€ 2 ⭐1 πŸ––2

⭐0

πŸ––0
Python
Shell
πŸ‘¨β€πŸ’»πŸš€ 19 ⭐0 πŸ––0

s3-restore πŸ‘¨β€πŸ’»πŸš€ 11

Restores deleted objects of an S3 version enabled bucket

Go
Dockerfile

⭐0

πŸ––0

s3-restore

Restores deleted objects of an S3 version enabled bucket

Go
Dockerfile
πŸ‘¨β€πŸ’»πŸš€ 11 ⭐0 πŸ––0

⭐0

πŸ––0
Rust
πŸ‘¨β€πŸ’»πŸš€ 5 ⭐0 πŸ––0

udmp-wpa-supplicant-monitor πŸ‘¨β€πŸ’»πŸš€ 2

Monitors the wpa-supplicant container

Go
Dockerfile

⭐0

πŸ––0

udmp-wpa-supplicant-monitor

Monitors the wpa-supplicant container

Go
Dockerfile
πŸ‘¨β€πŸ’»πŸš€ 2 ⭐0 πŸ––0

⭐0

πŸ––1
HTML
CSS
πŸ‘¨β€πŸ’»πŸš€ 9 ⭐0 πŸ––1

play-cloudwickone-slick-template πŸ‘¨β€πŸ’»πŸš€ 1

Play slick (postgres) template

Scala
HTML
JavaScript
CSS

⭐1

πŸ––0

play-cloudwickone-slick-template

Play slick (postgres) template

Scala
HTML
JavaScript
CSS
πŸ‘¨β€πŸ’»πŸš€ 1 ⭐1 πŸ––0

centos-base πŸ‘¨β€πŸ’»πŸš€ 4

Docker CentOS base image with SSH and Supervisor configured

Shell

⭐0

πŸ––1

centos-base

Docker CentOS base image with SSH and Supervisor configured

Shell
πŸ‘¨β€πŸ’»πŸš€ 4 ⭐0 πŸ––1

mapreduce_joins πŸ‘¨β€πŸ’»πŸš€ 6

Simple examples to illustrate joins in mapreduce

Java

⭐0

πŸ––1

mapreduce_joins

Simple examples to illustrate joins in mapreduce

Java
πŸ‘¨β€πŸ’»πŸš€ 6 ⭐0 πŸ––1

game_data_gen πŸ‘¨β€πŸ’»πŸš€ 6

Mocks data generated by gaming website

Ruby

⭐0

πŸ––2

game_data_gen

Mocks data generated by gaming website

Ruby
πŸ‘¨β€πŸ’»πŸš€ 6 ⭐0 πŸ––2

puppet_logstash πŸ‘¨β€πŸ’»πŸš€ 9

puppet module to install & manage logstash, lumberjack

Ruby
Puppet

⭐0

πŸ––1

puppet_logstash

puppet module to install & manage logstash, lumberjack

Ruby
Puppet
πŸ‘¨β€πŸ’»πŸš€ 9 ⭐0 πŸ––1

movie_data_gen πŸ‘¨β€πŸ’»πŸš€ 8

set of programs to generate random movie data set

Ruby

⭐0

πŸ––1

movie_data_gen

set of programs to generate random movie data set

Ruby
πŸ‘¨β€πŸ’»πŸš€ 8 ⭐0 πŸ––1

analytics-game-demo πŸ‘¨β€πŸ’»πŸš€ 6

Analytics demo on game data

R
Ruby
Shell

⭐0

πŸ––1

analytics-game-demo

Analytics demo on game data

R
Ruby
Shell
πŸ‘¨β€πŸ’»πŸš€ 6 ⭐0 πŸ––1

puppet_zookeeper πŸ‘¨β€πŸ’»πŸš€ 6

puppet module to install and manage apache zookeeper

Puppet

⭐0

πŸ––1

puppet_zookeeper

puppet module to install and manage apache zookeeper

Puppet
πŸ‘¨β€πŸ’»πŸš€ 6 ⭐0 πŸ––1

cm_api πŸ‘¨β€πŸ’»πŸš€ 1

illustrates cloudera manager api use with ruby (httparty)

Ruby

⭐0

πŸ––1

cm_api

illustrates cloudera manager api use with ruby (httparty)

Ruby
πŸ‘¨β€πŸ’»πŸš€ 1 ⭐0 πŸ––1

index_logs πŸ‘¨β€πŸ’»πŸš€ 1

Scala application to interface with solr for indexing and querying apache http log events

Shell
Scala

⭐0

πŸ––1

index_logs

Scala application to interface with solr for indexing and querying apache http log events

Shell
Scala
πŸ‘¨β€πŸ’»πŸš€ 1 ⭐0 πŸ––1

sinatra_delayed_job_active_record πŸ‘¨β€πŸ’»πŸš€ 3

Simple project illustrating how to use Delayed_job active record with Sinatra

Ruby
Shell

⭐0

πŸ––0

sinatra_delayed_job_active_record

Simple project illustrating how to use Delayed_job active record with Sinatra

Ruby
Shell
πŸ‘¨β€πŸ’»πŸš€ 3 ⭐0 πŸ––0

play-cloudwickone-mongo-template πŸ‘¨β€πŸ’»πŸš€ 5

Play MongoDB template

Scala
HTML
JavaScript
CSS

⭐0

πŸ––0
Scala
HTML
JavaScript
CSS
πŸ‘¨β€πŸ’»πŸš€ 5 ⭐0 πŸ––0

cloudwick_one_template πŸ‘¨β€πŸ’»πŸš€ 1

Bootstrap template for Cloudwick One

JavaScript
HTML

⭐0

πŸ––0

cloudwick_one_template

Bootstrap template for Cloudwick One

JavaScript
HTML
πŸ‘¨β€πŸ’»πŸš€ 1 ⭐0 πŸ––0

⭐0

πŸ––0

docker-hadoop-worker

Docker Hadoop Worker Image

Shell
πŸ‘¨β€πŸ’»πŸš€ 2 ⭐0 πŸ––0

⭐0

πŸ––0

docker-hadoop-master

Hadoop Master Docker Image

Shell
πŸ‘¨β€πŸ’»πŸš€ 2 ⭐0 πŸ––0

puppet_module_mongo πŸ‘¨β€πŸ’»πŸš€ 13

Puppet module to manage mongodb

Puppet

⭐0

πŸ––0

puppet_module_mongo

Puppet module to manage mongodb

Puppet
πŸ‘¨β€πŸ’»πŸš€ 13 ⭐0 πŸ––0

sinatra_delayedjob_mongoid πŸ‘¨β€πŸ’»πŸš€ 3

Example usage of sinatra with delayed_job and mongoid

Ruby

⭐0

πŸ––0

sinatra_delayedjob_mongoid

Example usage of sinatra with delayed_job and mongoid

Ruby
πŸ‘¨β€πŸ’»πŸš€ 3 ⭐0 πŸ––0

deb-pkgs πŸ‘¨β€πŸ’»πŸš€ 13

Build debian packages for some big data projects

Shell

⭐0

πŸ––0

deb-pkgs

Build debian packages for some big data projects

Shell
πŸ‘¨β€πŸ’»πŸš€ 13 ⭐0 πŸ––0

puppet_kafka πŸ‘¨β€πŸ’»πŸš€ 6

puppet module to install kafka 0.8

Puppet

⭐0

πŸ––0

puppet_kafka

puppet module to install kafka 0.8

Puppet
πŸ‘¨β€πŸ’»πŸš€ 6 ⭐0 πŸ––0

puppet_storm πŸ‘¨β€πŸ’»πŸš€ 4

puppet module to deploy storm

Shell
Puppet

⭐0

πŸ––0

puppet_storm

puppet module to deploy storm

Shell
Puppet
πŸ‘¨β€πŸ’»πŸš€ 4 ⭐0 πŸ––0

blog-engine πŸ‘¨β€πŸ’»πŸš€ 3

a simple blog engine using mongo & sinatra

Ruby
CSS

⭐0

πŸ––0

blog-engine

a simple blog engine using mongo & sinatra

Ruby
CSS
πŸ‘¨β€πŸ’»πŸš€ 3 ⭐0 πŸ––0

puppet_jmxtrans πŸ‘¨β€πŸ’»πŸš€ 3

puppet module to install & manage jmxtrans

Puppet

⭐0

πŸ––0

puppet_jmxtrans

puppet module to install & manage jmxtrans

Puppet
πŸ‘¨β€πŸ’»πŸš€ 3 ⭐0 πŸ––0

puppet_ganglia πŸ‘¨β€πŸ’»πŸš€ 3

puppet module to install ganglia

Puppet

⭐0

πŸ––0

puppet_ganglia

puppet module to install ganglia

Puppet
πŸ‘¨β€πŸ’»πŸš€ 3 ⭐0 πŸ––0

puppet_module_base πŸ‘¨β€πŸ’»πŸš€ 2

puppet module to manage user's and install zsh(oh-my-zsh) for user

Puppet

⭐0

πŸ––0

puppet_module_base

puppet module to manage user's and install zsh(oh-my-zsh) for user

Puppet
πŸ‘¨β€πŸ’»πŸš€ 2 ⭐0 πŸ––0

puppet_scala πŸ‘¨β€πŸ’»πŸš€ 1

puppet module to install scala

Puppet

⭐0

πŸ––0

puppet_scala

puppet module to install scala

Puppet
πŸ‘¨β€πŸ’»πŸš€ 1 ⭐0 πŸ––0

tana-readwise-exporter πŸ‘¨β€πŸ’»πŸš€ 1

Export Readwise highlights to Tana

Python

⭐0

πŸ––0

tana-readwise-exporter

Export Readwise highlights to Tana

Python
πŸ‘¨β€πŸ’»πŸš€ 1 ⭐0 πŸ––0

tana-readwise-exporter πŸ‘¨β€πŸ’»πŸš€ 1

CLI to export readwise highlights to Tana.io

Go

⭐0

πŸ––0

tana-readwise-exporter

CLI to export readwise highlights to Tana.io

Go
πŸ‘¨β€πŸ’»πŸš€ 1 ⭐0 πŸ––0

chef-repo πŸ‘¨β€πŸ’»πŸš€ 16

base for chef code

Ruby
Perl
C

⭐0

πŸ––0

chef-repo

base for chef code

Ruby
Perl
C
πŸ‘¨β€πŸ’»πŸš€ 16 ⭐0 πŸ––0

navi πŸ‘¨β€πŸ’»πŸš€ 1

An interactive cheatsheet tool for the command-line

Shell
Makefile
Rust

⭐10183

πŸ––408

navi

An interactive cheatsheet tool for the command-line

Shell
Makefile
Rust
πŸ‘¨β€πŸ’»πŸš€ 1 ⭐10183 πŸ––408

slack-scala-client πŸ‘¨β€πŸ’»πŸš€ 1

A scala library for interacting with the slack api and real time messaging interface

Scala

⭐180

πŸ––110

slack-scala-client

A scala library for interacting with the slack api and real time messaging interface

Scala
πŸ‘¨β€πŸ’»πŸš€ 1 ⭐180 πŸ––110

⭐0

πŸ––0
πŸ‘¨β€πŸ’»πŸš€ 2 ⭐0 πŸ––0

awesome-bigdata πŸ‘¨β€πŸ’»πŸš€ 2

A curated list of awesome big data frameworks, ressources and other awesomeness.


⭐10553

πŸ––2383

awesome-bigdata

A curated list of awesome big data frameworks, ressources and other awesomeness.

πŸ‘¨β€πŸ’»πŸš€ 2 ⭐10553 πŸ––2383

docker-hadoop-base πŸ‘¨β€πŸ’»πŸš€ 1

Hadoop base image for Docker built on centos 7


⭐0

πŸ––0

docker-hadoop-base

Hadoop base image for Docker built on centos 7

πŸ‘¨β€πŸ’»πŸš€ 1 ⭐0 πŸ––0

⭐0

πŸ––1
πŸ‘¨β€πŸ’»πŸš€ 1 ⭐0 πŸ––1

hadoop πŸ‘¨β€πŸ’»πŸš€ 16

Set of Hadoop related UseCases


⭐0

πŸ––1

hadoop

Set of Hadoop related UseCases

πŸ‘¨β€πŸ’»πŸš€ 16 ⭐0 πŸ––1

ankus πŸ‘¨β€πŸ’»πŸš€ 371

DEPRICATED. Project maintained at https://github.com/cloudwicklabs/ankus


⭐0

πŸ––1

ankus

DEPRICATED. Project maintained at https://github.com/cloudwicklabs/ankus

πŸ‘¨β€πŸ’»πŸš€ 371 ⭐0 πŸ––1

realtime_processing πŸ‘¨β€πŸ’»πŸš€ 6

Set of UseCases for solving BigData RealTime Processing


⭐1

πŸ––3

realtime_processing

Set of UseCases for solving BigData RealTime Processing

πŸ‘¨β€πŸ’»πŸš€ 6 ⭐1 πŸ––3

datagenerators πŸ‘¨β€πŸ’»πŸš€ 11

[DEPRICATED] Use https://github.com/cloudwicklabs/generator


⭐0

πŸ––2

datagenerators

[DEPRICATED] Use https://github.com/cloudwicklabs/generator

πŸ‘¨β€πŸ’»πŸš€ 11 ⭐0 πŸ––2

rpm-specs πŸ‘¨β€πŸ’»πŸš€ 14

rpm spec files for various big data projects


⭐2

πŸ––1

rpm-specs

rpm spec files for various big data projects

πŸ‘¨β€πŸ’»πŸš€ 14 ⭐2 πŸ––1

flume_filtering πŸ‘¨β€πŸ’»πŸš€ 2

Filter HTTP log events based on status_codes using Interceptors and ChannelSelectors


⭐0

πŸ––6

flume_filtering

Filter HTTP log events based on status_codes using Interceptors and ChannelSelectors

πŸ‘¨β€πŸ’»πŸš€ 2 ⭐0 πŸ––6
Question Score: 91
Answer Score: 116
πŸ‘€ 312859

hadoop copy a local file system folder to HDFS

hadoop
hdfs
accepted answer βœ…
Question Score: 28
Answer Score: 73
πŸ‘€ 43621

hadoop fs -ls results in "no such file or directory"

hadoop
uri
hdfs
accepted answer βœ…
Question Score: 34
Answer Score: 55
πŸ‘€ 35833

How can I access S3/S3n from a local Hadoop 2.6 installation?

hadoop
amazon-web-services
amazon-s3
hadoop-yarn
hadoop2
accepted answer βœ…
Question Score: 11
Answer Score: 25
πŸ‘€ 7525

namespace image and edit log

hadoop
hdfs
hadoop2
accepted answer βœ…
Question Score: 3
Answer Score: 16
πŸ‘€ 1954

Can Apache Sqoop and Flume be used interchangeably?

hadoop
bigdata
sqoop
flume
Question Score: 6
Answer Score: 14
πŸ‘€ 24546

What is the maximum container(s) in a single-node cluster (hadoop)?

apache
hadoop
mapreduce
hadoop-yarn
hadoop2
accepted answer βœ…
Question Score: 10
Answer Score: 12
πŸ‘€ 15800

hadoop fs -text file returns "text: Unable to write to output stream."

hadoop
accepted answer βœ…
Question Score: 3
Answer Score: 11
πŸ‘€ 3616

Data lost after shutting down hadoop HDFS?

hadoop
hdfs
Question Score: 359
Answer Score: 10
πŸ‘€ 300761

Getting output of system() calls in Ruby

ruby
system
call
accepted answer βœ…
Question Score: 7
Answer Score: 10
πŸ‘€ 14584

Host and port to use to list a directory in hdfs

java
hadoop
hdfs
hortonworks-data-platform
accepted answer βœ…
Question Score: 5
Answer Score: 10
πŸ‘€ 20190

How to upload file to HDFS in Ubuntu

hadoop
hdfs
Question Score: 8
Answer Score: 10
πŸ‘€ 16709

Different ways to import files into HDFS

hadoop
import
hdfs
accepted answer βœ…
Question Score: 1
Answer Score: 10
πŸ‘€ 7385

relation between number of input splits and number of mappers in mapreduce hadoop

hadoop
mapreduce
accepted answer βœ…
Question Score: 5
Answer Score: 9
πŸ‘€ 15282

Standard practices for logging in MapReduce jobs

java
hadoop
mapreduce
hadoop2
mapr
Question Score: 9
Answer Score: 8
πŸ‘€ 5629

Accessing a file that is being written

hadoop
hdfs
accepted answer βœ…
Question Score: 10
Answer Score: 8
πŸ‘€ 23701

Opening a file stored in HDFS to edit in VI

ubuntu
hadoop
hdfs
vi
accepted answer βœ…
Question Score: 5
Answer Score: 7
πŸ‘€ 7901

mapreduce in java - gzip input files

java
hadoop
mapreduce
gzip
accepted answer βœ…
Question Score: 3
Answer Score: 7
πŸ‘€ 8601

MapReduce: How to get mapper to process multiple lines?

java
hadoop
input
split
mapreduce
Question Score: 6
Answer Score: 6
πŸ‘€ 9353

MapReduce or Spark for Batch processing on Hadoop?

hadoop
mapreduce
batch-processing
apache-spark
accepted answer βœ…
Question Score: 4
Answer Score: 6
πŸ‘€ 1546

HBase: How does data get written in a sorted manner into HFile?

hbase
hfile
Question Score: 3
Answer Score: 6
πŸ‘€ 17143

how to start and check job history on hadoop 2.5.2

hadoop
Question Score: 1
Answer Score: 6
πŸ‘€ 6635

Complete list of property that is used in Hadoop framework

java
hadoop
dictionary
mapreduce
hdfs
Question Score: 11
Answer Score: 6
πŸ‘€ 52925

Where is the classpath set for hadoop

hadoop
mapreduce
hadoop2
accepted answer βœ…
Question Score: 2
Answer Score: 5
πŸ‘€ 3197

Hadoop: Getting the input file name in the mapper only once

hadoop
mapreduce
accepted answer βœ…
Question Score: 0
Answer Score: 5
πŸ‘€ 2212

MRv2 / YARN Features

hadoop
mrv2
Question Score: 14
Answer Score: 5
πŸ‘€ 14069

.sparkstaging directory in hdfs is not deleted

apache-spark
accepted answer βœ…
Question Score: 2
Answer Score: 4
πŸ‘€ 1310

Hadoop own data types

hadoop
types
Question Score: 3
Answer Score: 3
πŸ‘€ 1109

How should I persist my event stream to cold storage?

hadoop
bigdata
apache-kafka
amazon-kinesis
azure-eventhub
accepted answer βœ…
Question Score: 3
Answer Score: 3
πŸ‘€ 6881

How to get data from HDFS? Hive?

hadoop
hive
accepted answer βœ…
Question Score: 0
Answer Score: 3
πŸ‘€ 560

What is the -file argument for AWS EMR

hadoop
amazon-web-services
amazon-emr