As we are approaching the end of 2017, many people have resolutions or goals for the new year. How about a goal to get organized...in your data lake?
The most important aspect of organizing a data lake is optimal data retrieval.
It all starts with the zones of your data lake, as shown in the following diagram:
Hopefully the above diagram is a helpful starting place when planning a data lake structure. I have used all of the above zones in projects (with the exception of a transient zone which I haven't had a requirement for).
You Might Also Like...
Data Lake Use Cases and Planning Considerations <--More tips on organizing the data lake in this post