Skip to content

Apache Accumulo - AWS distribution based on Apache Hadoop, Apache ZooKeeper, CentOS 7 and Oracle Java 8

Notifications You must be signed in to change notification settings

kthare10/accumulo

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

accumulo

Apache Accumulo in AWS

This work is inspired by:

  1. Exogeni Recipies
  2. Accumulo in Docker

What Is Apache Accumulo?

Apache Accumulo is a key/value store based on the design of Google's BigTable. Accumulo stores its data in Apache Hadoop's HDFS and uses Apache Zookeeper for consensus. While many users interact directly with Accumulo, several open source projects use Accumulo as their underlying store.

See official documentation for more information.

Accumulo Cluster

Cluster

How to use this cloudformation?

Pre-requisites

  1. User must have AWS account with privilges to create/delete IAMRole, IAMPolicy and IAMProfile
  2. Key pair has been created
  3. Change the value of ACCUMULO_PASSWORD in setupaccumulo.sh. Default value is 'secret'

Create a Accumulo Stack

Create a stack on AWS Cloudformation service by using accumuloCloudFormation.json.

Logon to AWS Console and Search for Cloudformation service

Cloudformation

Click Create

Cloudformation

Choose accumuloCloudFormation.json and click Next

Cloudformation

Specify the Stack name and KeyPair and click Next

Cloudformation

Click Next

Cloudformation

Ensure the checkbox for IAMRole warning is checked and click Create

Cloudformation

Stack creation will begin and status will be displayed as below

Cloudformation

namenode instance: NameNode Web UI on port 50070

NameNode: http://[PublicIPv4 of Instance]:50070/dfshealth.html#tab-datanode

NameNode

resourcemanager instance: ResourceManager Web UI on port 8088

ResourceManager: http://[PublicIPv4 of Instance]:8088

ResourceManager

accumulomaster instance: Accumulomaster Web UI on port 9995

Accumulomaster: http://[PublicIPv4 of Instance]:9995

Accumulomaster

worker instance: Worker Web UI on port 9995

Worker: http://[PublicIPv4 of Instance]:9995

Worker

Test Cluster

NOTE: Assumes the cluster is running as configured.

A script named usertable-example.sh will create a sample usertable in Accumulo using 100 randomly generated user entries.

This script should be executed on accumulomaster console as root user.

Accumulo stack hacks

Launch Accumulo with one worker

  1. Edit accumuloCloudFormation.json and remove the section for worker2 line 600-705
  2. Replace worker2 with empty string in accumuloCloudFormation.json

Launch Multiple Accumulo stacks

  1. Make copy of [accumuloCloudFormation.json](.../master/accumuloCloudFormation.json]
  2. Replace bucket name i.e. cometbucket in accumuloCloudFormation.json to a different name in one of the files

About

Apache Accumulo - AWS distribution based on Apache Hadoop, Apache ZooKeeper, CentOS 7 and Oracle Java 8

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages