Practical Implementation of a Data Lake
26,99 €
Sofort verfügbar, Lieferzeit: Sofort lieferbar
Practical Implementation of a Data Lake, Apress
Translating Customer Expectations into Tangible Technical Goals
Von Nayanjyoti Paul, im heise Shop in digitaler Fassung erhältlich
Produktinformationen "Practical Implementation of a Data Lake"
This book explains how to implement a data lake strategy, covering the technical
and business challenges architects commonly face. It also illustrates how and
why client requirements should drive architectural decisions.
Drawing upon a specific case from his own experience, author Nayanjyoti Paul
begins with the consideration from which all subsequent decisions should flow:
what does your customer need? He also describes the importance of identifying
key stakeholders and the key points to focus on when starting a new project.
Next, he takes you through the business and technical requirement-gathering
process, and how to translate customer expectations into tangible technical
goals. From there, you’ll gain insight into the security model that will allow
you to establish security and legal guardrails, as well as different aspects of
security from the end user’s perspective. You’ll learn which organizational
roles need to be onboarded into the data lake, their responsibilities, the
services they need access to, and how the hierarchy of escalations should work.
Subsequent chapters explore how to divide your data lakes into zones, organize
data for security and access, manage data sensitivity, and techniques used for
data obfuscation. Audit and logging capabilities in the data lake are also
covered before a deep dive into designing data lakes to handle multiple kinds
and file formats and access patterns. The book concludes by focusing on
production operationalization and solutions to implement a production setup.
After completing this book, you will understand how to implement a data lake,
the best practices to employ while doing so, and will be armed with practical
tips to solve business problems.
You will:
* Understand the challenges associated with implementing a data lake
* Explore the architectural patterns and processes used to design a new data
lake
* Design and implement data lake capabilities
* Associate business requirements with technical deliverables to drive success
This book explains how to implement a data lake strategy, covering the technical
and business challenges architects commonly face. It also illustrates how and
why client requirements should drive architectural decisions.
Drawing upon a specific case from his own experience, author Nayanjyoti Paul
begins with the consideration from which all subsequent decisions should flow:
what does your customer need? He also describes the importance of identifying
key stakeholders and the key points to focus on when starting a new project.
Next, he takes you through the business and technical requirement-gathering
process, and how to translate customer expectations into tangible technical
goals. From there, you’ll gain insight into the security model that will allow
you to establish security and legal guardrails, as well as different aspects of
security from the end user’s perspective. You’ll learn which organizational
roles need to be onboarded into the data lake, their responsibilities, the
services they need access to, and how the hierarchy of escalations should work.
Subsequent chapters explore how to divide your data lakes into zones, organize
data for security and access, manage data sensitivity, and techniques used for
data obfuscation. Audit and logging capabilities in the data lake are also
covered before a deep dive into designing data lakes to handle multiple kinds
and file formats and access patterns. The book concludes by focusing on
production operationalization and solutions to implement a production setup.
After completing this book, you will understand how to implement a data lake,
the best practices to employ while doing so, and will be armed with practical
tips to solve business problems.
What You Will Learn
* Understand the challenges associated with implementing a data lake
* Explore the architectural patterns and processes used to design a new data
lake
* Design and implement data lake capabilities
* Associate business requirements with technical deliverables to drive success
Who This Book Is For
Data Scientists and Architects, Machine Learning Engineers, and Software
Engineers.
Nayanjyoti Paul is an Associate Director and Chief Azure Architect for GenAI and
LLM CoE for Accenture. He is the product owner and creator of a patented asset.
Presently, he leads multiple projects as a lead architect around generative AI ,
large language models, data analytics, and machine learning. Nayan is a
certified Master Technology Architect, certified Data Scientist, and certified
Databricks Champion with additional AWS and Azure certifications. He is a
speaker at conferences like Strata Conference, Data Works Summit, and AWS
Reinvent. He also delivers guest lectures at Universities.
Chapter 1: Understanding the Customer Needs.- Chapter 2: Security Model.-
Chapter 3: Organizational Model.- Chapter 4: Data Lake Structure.- Chapter 5:
Production Playground.- Chapter 6: Production Operationalization.- Chapter 7:
Miscellaneous.
Artikel-Details
- Anbieter:
- Apress
- Autor:
- Nayanjyoti Paul
- Artikelnummer:
- 9781484297353
- Veröffentlicht:
- 03.10.23
Barrierefreiheit
This PDF does not fully comply with PDF/UA standards, but does feature limited screen reader support, described non-text content (images, graphs), bookmarks for easy navigation and searchable, selecta
- keine Vorlesefunktionen des Lesesystems deaktiviert (bis auf) (10)
- navigierbares Inhaltsverzeichnis (11)
- logische Lesereihenfolge eingehalten (13)
- kurze Alternativtexte (z.B für Abbildungen) vorhanden (14)
- Inhalt auch ohne Farbwahrnehmung verständlich dargestellt (25)
- hoher Kontrast zwischen Text und Hintergrund (26)
- Navigation über vor-/zurück-Elemente (29)
- alle zum Verständnis notwendigen Inhalte über Screenreader zugänglich (52)
- Kontakt zum Herausgeber für weitere Informationen zur Barrierefreiheit (99)