The Ten Commandments of Hadoop (Work In Progress – feel free to edit)
- Thy namenode shall always persist (We will have multiple recovery methods)
- We accept data as is (Come as you are)
- We love and expect metadata (Nothing enters, changes, or exits without metadata)
- The family tree will always be maintained. (We will always keep data lineage, even if the data is dead)
- Entry requires data classification
- Security/Risk
- Personal
- Public
- Private
- Confidential
- HIPAA
- etc……
- Business/Purpose
- Marketing
- Inventory
- Sales
- Security
- etc
- We will keep the end in mind
- Data will be time stamped as it enters
- Data will have an expiration estimate
- We will bring in as many layers of data as possible
- We will make the data extensible
- Security is integral to the design
- We will create security groups and appropriate controls for each type of data
- You will be authenticated and authorized before access is granted
This entry was posted in Uncategorized. Bookmark the
permalink.