1 Introduction
Scalability, flexibility, predictable costs, and massively reduced capital expenditures are just some of the reasons for many organizations to adopt a cloud-first strategy for their future business development. They motivate companies to migrate more and more of their applications, computing workloads, and data to public clouds. The ongoing COVID pandemic, which forced so many people to work from home for months, has only further increased worldwide cloud adoption.
Driven by the ongoing digital transformation, data is quickly becoming the most valuable asset for every organization, more important than oil or land. Regardless of whether data is the primary product for a business or just a tool to reach their customers, improve productivity, enable better planning, or a myriad of other reasons - the ability to collect, store and process vast amounts of information becomes a critical factor for business success in any industry or geography. Increasingly, a company's only option to achieve the necessary scale is to harness the power of the public cloud.
However, modern application architectures, deployment patterns, and technology stacks strongly affect the security landscape and introduce completely new risks to cloud customers. The continuously growing number and scale of data breaches indicate that even the largest organizations with fully staffed IT teams and large security budgets are failing to keep up with the cloud transformation. Unfortunately, many companies realize the challenges and vulnerabilities of dealing with sensitive data in the cloud too late, facing massive financial, legal, and reputational losses for their non-compliance.
The primary challenge for modern data-driven business is, of course, the overall scale and complexity of storing and managing data across heterogeneous (multi-cloud and hybrid) environments - data is everywhere, spread across a multitude of systems, models, and formats with incompatible technology stacks and security controls. Also, data never exists in a vacuum, isolated within just a single database, application, or network. It is constantly moving and transforming, passing through multiple systems, and being processed by numerous applications and services. Even the basic data governance at this scale becomes impossible without full visibility into data locations, classification, access policies, and so on.
The problem is further complicated by constantly changing governance and compliance requirements. After GDPR, new strict regulations for handling personally identifiable information are being introduced by numerous countries and state-level governments around the world, with massive fines for compliance violations. Some of the more recent legal decisions, such as the infamous Schrems II ruling, can completely change technical requirements for storing data in a cloud overnight, introducing new mandatory security controls for cloud projects. These quickly changing demands put a strain on both customers and vendors of cloud services.
Traditionally, data protection solutions have focused on securing individual infrastructure components - databases, file and object stores, networking, etc. - but this approach is no longer sustainable in multi-cloud environments. Another approach - data-centric security - is gaining popularity, focusing instead on data itself, providing full visibility into data at every stage of its active life cycle.
With Data Custodian, SAP offers a natively multi-cloud, fully managed SaaS solution for data transparency, protection, and compliance across the company's own business applications, as well as public hyperscalers and SAP-managed clouds.