精品欧美一区二区三区在线观看 _久久久久国色av免费观看性色_国产精品久久在线观看_亚洲第一综合网站_91精品又粗又猛又爽_小泽玛利亚一区二区免费_91亚洲精品国偷拍自产在线观看 _久久精品视频在线播放_美女精品久久久_欧美日韩国产成人在线

Cloud-Native Data Lakes: How Can They Benefit Enterprises in the Era of Big Data?

原創 精選
Techplur
In the age of big data and artificial intelligence, data lakes are expected to become a key platform for the convergence of storage, computing, and analytics.

Data lakes have gained popularity among the general public over the past few years. Despite the lack of consensus regarding the definition, global tech giants such as Amazon, Alibaba, Tencent, and Huawei have developed plans to construct their own.

In the age of big data and artificial intelligence, data lakes are expected to become a key platform for the convergence of storage, computing, and analytics, and this trend is even more evident when data lakes are complemented by cloud-native technologies.


Data lakes are on the rise

In 2010, James Dixon, founder and CTO of Pentaho, introduced the concept of data lakes, which are analogous to raw water since they contain unprocessed data that retains its original structure.

Various types of users can access the lake to obtain, distill, and purify data (water) flowing from multiple sources. Therefore, the data lake is typically characterized as a centralized system for storing unstructured, semi-structured, and binary data in its original format, which can store structured, semi-structured, and unstructured data.

In the convergence and development of big data, the boundaries of data lakes are expanding. This develops into a comprehensive big data solution with unified storage, multi-paradigm computation and analysis of multi-source heterogeneous data, and unified management and invocation.

In this regard, data lakes differ significantly from data warehouses.

A data warehouse is a solution designed to convert data into a particular format and then replicate it to another library for columnar storage at regular intervals to meet enterprise querying and data analysis needs.

Business data used to be primarily ERP and CRM data, which can often be terabytes in size. Therefore, enterprises typically use data warehouse solutions locally to store and analyze their data. Data warehouses are fixed paradigms, and the data underlying the paradigm cannot be changed.

Internet development has resulted in an explosion of data, especially a rise in unstructured ones, and accelerated changes in enterprise systems. Digital transformation, now a hot topic in the IT industry, calls for a deeper understanding of data. Thus, it is imperative to retain the original information contained in the data to meet the changing needs of the future.

With the advent of big data, traditional data warehouses can no longer meet enterprises' demands for real-time and interactive analysis, but data lakes have adopted the design principle of "loose in, tight out", eliminating the strict model during the initialization phase and placing the "schme" later to achieve further flexibility, while ensuring data consistency and performance through unified storage and computation optimization. As such, big data has gradually gained attention towards the data lake model.

The concept of the data lake is no longer restricted to a specific technology or software product but covers a wide range of applications such as storage, computing, and artificial intelligence to meet the needs of enterprise-level users in terms of production management.


Cloud-native and data lakes: Why do they make great partners?

With the rapid evolution of enterprise business, database middleware such as Oracle has become increasingly difficult to adapt to the changing needs of data processing.

For this reason, the IT industry continuously produces new computing engines.

Several enterprises have developed their own open-source Hadoop data lake architectures, where the original data is stored on HDFS uniformly, and the engine is based on the Hadoop and Spark open-source ecosystems, enabling storage and computing to be converged as one.

This architecture, however, has some disadvantages for enterprises because they must operate and manage the whole cluster independently, which is costly and results in poor stability.

In this case, a cloud-hosted Hadoop data lake was created (i.e., the EMR open-source data lake). Cloud vendors provide and manage the underlying physical servers and open-source software versions, and the data is still stored uniformly in HDFS on an engine based on the Hadoop and Spark open-source ecosystems. With this architecture, enterprises can enhance machine resilience and stability using cloud-based IaaS, thereby reducing their overall operational costs; however, enterprises are still responsible for the operations of applications such as managing and governing the HDFS system and services.

As storage and computing are coupled, the stability is not optimal, the resources cannot be scaled independently, and the cost of use is still high due to the close coupling. Meanwhile, due to the internal limitations of open-source software, traditional data lakes cannot meet enterprises' needs regarding data scale, storage costs, query performance, and flexible computing architectures. In other words, the data lake architecture is not yet ideal.

By utilizing cloud computing, the data lake can be maximized and played to its full potential. Cloud computing has highly flexible, resilient, and scalable computing and storage resources, making storing, analyzing, and applying data incredibly easy.

Moreover, the most outstanding value of the data lake lies in the unification of various data formats within the enterprise and the capability to analyze data in multiple ways on top of one piece of data with cost-effective and efficient mining. Since 2010, when the idea of the data lake was first proposed, cloud service providers have played an essential role in its implementation.

In the cloud-native age, we deploy data lakes in a cloud-native manner. When people hear the term cloud-native, they immediately think of serverless, containerization, etc. However, in recent years, the term has been extended to cover a wide range of products and services.

Essentially, cloud-native is a paradigm for designing distributed systems with resilience, security, stability, and other advantages that can be maximized to enhance performance.

The data lake can benefit from the performance enhancement that the cloud provides. An advantage of cloud computing is its high availability. Compared with on-premise IDC, cloud computing offers more redundancy of resources and can seamlessly switch to other nodes in the event of a failure to ensure business continuity.

Meanwhile, it exhibits resilience. Due to its scalability and affordability, cloud computing can solve the problems associated with massive business volumes and handle the enormous scale of resources and the emergent nature of big data analytics.

The final factor is agility. By eliminating repetitive and complex IT work, the cloud enables enterprises to iterate, deploy, operate, and innovate quickly.

Furthermore, data lakes can optimize performance more effectively in a cloud-native environment through features such as analytics acceleration from a rich context, real-time data value mining from the convergence of stream and batch processing, as well as security and quality improvements with a one-stop solution for data management.

Enterprises can effectively utilize the public cloud infrastructure, and data lake platforms now have a greater range of technology options, including pure hosted storage on the cloud, which can gradually replace HDFS as the storage infrastructure, and the engine richness continues to improve. By leveraging the cloud's unique characteristics of "pooling, resilience, and agility", many data and application layers can be realized, and cloud-native becomes a natural choice for data lakes and even big data.


The future of cloud-native data lakes

Essentially, cloud-native data lakes are new technical products developed by big data computing platforms with the help of cloud computing theory, which supports flexible heterogeneous data storage and resilient scaling of computing resources and helps enterprises cope with the current business requirements of more and more complex data structures and data processing timeliness.

Therefore, cloud-native data lakes are only an architectural principle, and there are a number of ways to implement them, including EMR and Flink solutions.

Although data lake technology is developing rapidly in China with more public cloud vendors making innovations, the implementation of data lakes still faces many difficulties.

There are currently barriers and difficulties in the data-aware collection, categorization, cleaning, and lack of experience in data lake modeling. In general, the overall development of China's data lake market is at an early stage, with inconsistent roadmaps and chaotic product capabilities in the industry.

At the product level, the data governance capability and total link strength of the data lake still need to be further bolstered.  

Data governance requires the inclusion of data classification and rules in the directory. Suppose an enterprise's control over the data lake is insufficient. In that case, it will lead to the poor design of the data lake directory and overall architecture. The data in a lake will not be adequately archived or maintained, making a data lake a data swamp. Due to the lack of contextual metadata association, the data swamp cannot be retrieved, resulting in users being unable to analyze and utilize the data effectively.

Chinese domestic vendors that provide total–link cloud-native data lake services are currently insufficient, and most only support data lake components. The downstream companies are therefore limited to relying on multiple vendors for data collection, governance, analysis, and visualization.

Furthermore, application-level training and industry awareness are lacking for cloud-native data lakes. Professional employees are in high demand by enterprises as the big data and artificial intelligence technology stack continue to evolve. Sometimes, managers have little knowledge of data governance and blindly build a data lake without thoroughly analyzing the current situation, leading to poor implementation of business. Despite the widespread recognition of the value of data, the data lake has faced many challenges in promoting and raising awareness, as many enterprises remain cautious and wait and see.

Aside from this, as enterprises move toward digital transformation, data has become one of the most critical production factors, and one of the biggest risks is security, particularly access control. There is a large amount of data entering the lake without any regulation. Once specific data contains privacy and regulatory requirements that other data does not, data leakage and loss will likely occur, resulting in incalculable consequences.

A new industry faces numerous challenges during its early stages, but imperfections are precisely what enable a business to grow. As the "China Cloud-Native Data Lake Application White Paper" of ??iResearch ??shows, the maturity of the big data industry in China was boosted by favorable national policies, such as the "Action Plan for Promoting the Development of Big Data", the "Implementation Plan for National Big Data Center and Collaborative Innovation System of Computing Hub" and other documents related to the advancement of Internet technology and digital transformation.

As China's market for cloud-native data lakes is expected to grow at a 39.7% CAGR over the next five years, it is vital for us to keep an eye on the development in the near future.


Reference:

??https://www.iresearch.com.cn/Detail/report?id=3972&isfree=0??

責任編輯:龐桂玉 來源: 51CTO
相關推薦

2022-08-30 20:45:41

cloudcloud natieducation

2011-08-10 16:45:55

Big Data

2012-05-28 13:58:36

Hadoop大數據

2022-08-31 16:15:56

AIOpsCloud Nati

2022-08-31 11:39:44

big datablockchainOKLink

2012-10-18 10:15:01

IBMdw

2011-08-18 14:23:52

Big Data

2022-08-31 16:13:11

cloud nati

2012-02-20 09:27:00

IBM大數據Big Data

2013-05-21 10:05:55

倫敦奧運Big Data奧運大數據

2020-06-02 09:28:46

大數據物聯網IOT

2011-10-28 08:47:39

IBMBig Datn數據分析

2013-01-07 09:40:28

谷歌大數據Android

2012-05-30 13:44:45

大數據Etu知意圖

2015-05-15 10:12:35

NETGEAR云數據中心

2016-02-16 14:42:58

戴爾云計算

2012-05-31 10:14:23

大數據

2013-05-23 09:34:49

Big Data大數據

2017-03-09 13:17:27

大數據

2022-06-09 11:47:21

工具數據儀連接器
點贊
收藏

51CTO技術棧公眾號

国产精品一二三在线观看| 国产精品三级美女白浆呻吟| 性猛交╳xxx乱大交| 免费成人在线电影| 欧美国产禁国产网站cc| 亚洲va欧美va国产综合剧情| 成年人免费看毛片| 日韩成人激情| 日韩风俗一区 二区| 污污动漫在线观看| av免费不卡| 国产欧美日韩精品一区| 91精品国产一区二区三区动漫| 中日韩精品视频在线观看| 日韩成人三级| 亚洲精品国产精品国产自| 国产一伦一伦一伦| 色戒汤唯在线观看| 一区二区在线电影| 欧美综合77777色婷婷| 精品人妻一区二区三区四区不卡 | 懂色aⅴ精品一区二区三区| 亚洲欧美日韩人成在线播放| 欧美日韩综合精品| 亚洲精品97久久中文字幕| 蜜臀av一区二区| 欧美亚洲一级片| 校园春色 亚洲| 日韩欧美不卡| 亚洲人成网站免费播放| 国产吃瓜黑料一区二区| 四虎国产精品免费久久| 欧美午夜无遮挡| 隔壁人妻偷人bd中字| 欧美精品电影| 中文字幕精品一区二区精品绿巨人 | 色综合久综合久久综合久鬼88 | 好吊色一区二区| 激情综合网天天干| 国产精品成人国产乱一区| 亚洲国产成人精品激情在线| 欧美福利影院| 欧美插天视频在线播放| 亚洲一二三四五六区| 国产一区不卡| 亚洲欧美中文日韩在线v日本| 亚洲天堂av网站| 亚洲日本va| 日韩一级片在线播放| 天天综合天天添夜夜添狠狠添| 色成人免费网站| 在线观看亚洲专区| 人人爽人人av| 青青在线精品| 91麻豆精品国产91久久久久久久久| 亚洲黄色小视频在线观看| 欧美精选视频一区二区| 欧美主播一区二区三区| 无码人妻精品一区二区三区66| 刘亦菲一区二区三区免费看| 日韩欧中文字幕| 国产成人精品视频ⅴa片软件竹菊| xxx欧美xxx| 在线观看日韩电影| 伊人影院综合在线| 国产999精品在线观看| 在线播放中文一区| 久久黄色一级视频| 国产在线播放精品| 亚洲色图狂野欧美| 18啪啪污污免费网站| 天天做天天爱综合| 欧美刺激性大交免费视频| 国产在线一区视频| 亚洲欧美日韩在线观看a三区 | 国产精品xxxxxx| 麻豆91在线观看| 亚洲综合大片69999| 亚洲高清精品视频| www久久久久| 涩涩涩999| av大全在线| 岛国av一区二区在线在线观看| 男人舔女人下面高潮视频| 69堂精品视频在线播放| 91精品国产一区二区三区蜜臀 | 久久99精品久久久久久园产越南| 一个色综合导航| 唐朝av高清盛宴| 美女久久网站| 亚洲一区精品电影| 日韩av资源站| 最新成人av在线| 国产精品秘入口18禁麻豆免会员| 九色成人搞黄网站| 日韩一区二区三区av| 色婷婷av777| 久久久人成影片免费观看| 久久久噜噜噜久久中文字免| 最近中文在线观看| 成人国产在线观看| 亚洲精品中文字幕在线 | 国产成人精品久久| a在线观看视频| 国产视频亚洲色图| 国产夫妻自拍一区| 久久影院午夜精品| 91精品福利在线一区二区三区| 欧美色图亚洲激情 | 欧美精品99久久| 91国产精品| 亚洲四色影视在线观看| 久青草免费视频| 国内成人自拍视频| 日本高清不卡一区二区三| 欧美女同一区| 在线播放欧美女士性生活| 欧美成人国产精品一区二区| 亚洲成人直播| 亚洲一区中文字幕| 在线观看免费黄色| 一本久久a久久免费精品不卡| 中文字幕无码毛片免费看| 日韩精品看片| 国产97色在线|日韩| 网站黄在线观看| 亚洲一区在线观看视频| 亚洲制服中文字幕| 欧美少妇性xxxx| 日韩免费黄色av| 欧美理论在线观看| 午夜视频一区二区三区| 色欲无码人妻久久精品| 99久久久久国产精品| 国产精品久久国产精品99gif| 午夜影院在线视频| 亚洲线精品一区二区三区八戒| 香蕉视频色在线观看| 国产精品7m凸凹视频分类| 国产精品极品美女在线观看免费 | 僵尸世界大战2 在线播放| 国产精品国产亚洲精品| 日韩亚洲国产中文字幕| 在线免费观看一区二区| 国产精品免费aⅴ片在线观看| 少妇高清精品毛片在线视频| 亚洲色图丝袜| 欧美有码在线观看视频| 日色在线视频| 色婷婷国产精品| 在线免费观看麻豆| 日韩影院免费视频| 午夜久久资源| 婷婷激情成人| 久久亚洲精品一区二区| 国产99视频在线| 亚洲夂夂婷婷色拍ww47| 东京热av一区| 国产精品一区亚洲| 日韩精品不卡| 91成人小视频| 欧美大片免费观看在线观看网站推荐| 亚洲精品97久久中文字幕无码| 亚洲高清免费观看高清完整版在线观看| 欧美丰满熟妇bbb久久久| 在线免费高清一区二区三区| 免费在线成人av| 国产韩日精品| 99久久精品免费| 伊人青青综合网站| 中文字幕av资源| 亚洲品质自拍视频网站| 91丨porny丨九色| 国产一级久久| 亚洲国产欧美不卡在线观看 | 9.1成人看片免费版| 天堂久久一区二区三区| 亚洲一区三区电影在线观看| 榴莲视频成人app| 国内精品国产三级国产在线专| 亚洲av成人精品毛片| 欧美三级午夜理伦三级中视频| 欧美日韩色视频| www.日韩av| 天天色综合社区| 国产综合欧美| 日韩欧美精品在线不卡| 电影中文字幕一区二区| 欧美精品xxx| 高清毛片在线看| 日韩亚洲欧美中文三级| 国产精品久免费的黄网站| 亚洲国产激情av| 亚洲日本久久久| 奇米四色…亚洲| 超碰成人免费在线| 日本一二区不卡| 精品国产乱码久久久久久108| 国内自拍亚洲| 91国内在线视频| 理论片午午伦夜理片在线播放| 精品久久久久久久久久久院品网 | 成人直播视频| 按摩亚洲人久久| 欧美捆绑视频| 亚洲成人av在线播放| 在线观看毛片av| 精品高清美女精品国产区| 波兰性xxxxx极品hd| 26uuuu精品一区二区| 在线播放av网址| 国内一区二区视频| www.欧美日本| 亚洲在线电影| 美女扒开大腿让男人桶| 91九色精品国产一区二区| 欧美日韩一区综合| 黄色免费大全亚洲| 97人人干人人| 伊人久久综合网另类网站| 国产精品7m视频| 国产高清不卡| 欧美亚洲成人精品| 成人高潮aa毛片免费| 欧美另类极品videosbestfree| www.中文字幕久久久| 国产视频在线观看一区二区| 日本高清视频免费看| 日韩精品专区在线| 国产毛片久久久久| 欧美日韩国产一级片| 色老头一区二区| 日韩欧美一区二区三区久久| 日韩男人的天堂| 亚洲国产wwwccc36天堂| 妺妺窝人体色www聚色窝仙踪| 亚洲色图视频网站| 亚洲视频重口味| 中文字幕一区二区三区四区 | 国产欧美日韩综合一区在线播放 | 午夜精品美女久久久久av福利| 亚洲人和日本人hd| 欧美一区三区二区在线观看| 亚洲+变态+欧美+另类+精品| 久久99九九| 综合亚洲色图| 欧美三级网色| 日韩av专区| 特级毛片在线免费观看| 91亚洲国产| 400部精品国偷自产在线观看| 99久久精品网| 日本精品福利视频| 黑丝一区二区三区| 国产96在线 | 亚洲| 亚洲自啪免费| 国产又大又黄又粗的视频| 男女性色大片免费观看一区二区| 污污的网站18| 国内精品视频666| 四虎精品一区二区| 久久综合久久综合久久综合| 微拍福利一区二区| 国产精品久久久久久久久久免费看| 日韩福利小视频| 亚洲一区二区三区在线看| 日韩av在线天堂| 色乱码一区二区三区88| 在线观看视频中文字幕| 欧美一区二区三区在线| 午夜精品在线播放| 日韩电影大片中文字幕| 国产三级在线| 欧美插天视频在线播放| h片在线观看视频免费免费| 清纯唯美亚洲激情| 日本国产亚洲| 国产伦精品一区二区三区视频黑人 | 久久电影国产免费久久电影| 97人人模人人爽人人澡| 99精品久久只有精品| 91麻豆制片厂| 一区二区成人在线| 中文字幕一区二区三区精品| 精品视频一区三区九区| 性生活黄色大片| 亚洲毛片在线免费观看| 久做在线视频免费观看| 91sao在线观看国产| 四虎地址8848精品| 久久99精品国产99久久| 婷婷伊人综合| 男人靠女人免费视频网站| 久久99日本精品| www.免费av| 亚洲欧美一区二区三区国产精品| 少妇一级淫片免费放中国| 欧美老肥妇做.爰bbww| 亚洲av电影一区| 色综合久久悠悠| 久久三级毛片| 欧美精品中文字幕一区二区| 欧美1区免费| 麻豆一区二区三区视频| va亚洲va日韩不卡在线观看| 九九热视频在线免费观看| 精品久久中文字幕久久av| 999av视频| 中文字幕日韩欧美精品在线观看| av福利导福航大全在线| 国产欧美va欧美va香蕉在线| 亚洲精品国产动漫| 日韩在线观看a| 九色综合国产一区二区三区| 麻豆精品免费视频| 亚洲成人精品影院| 国产夫妻在线观看| 日韩在线中文字幕| 欧美一级大片| 美女视频久久| 亚洲福利专区| 日本道中文字幕| 一区二区不卡在线播放 | 欧州一区二区三区| 亚洲国产午夜伦理片大全在线观看网站| 99精品视频免费| 尤物网站在线观看| 亚洲一区二区美女| 精品人妻一区二区三区蜜桃| 久久高清视频免费| 婷婷精品久久久久久久久久不卡| 四虎影院一区二区三区| 久久久久免费| 美女爆乳18禁www久久久久久 | 免费看av成人| 日本日本19xxxⅹhd乱影响| 成人va在线观看| 国产网址在线观看| 亚洲成人精品视频| 国产第一页在线| 国产高清一区视频| 在线观看视频免费一区二区三区| 亚洲熟女一区二区三区| 亚洲自拍偷拍av| 黄色av网站免费在线观看| 欧美激情欧美狂野欧美精品| 伊人久久大香线蕉av超碰| 成人av在线播放观看| 国产成人啪免费观看软件| 一区视频免费观看| 欧美大肚乱孕交hd孕妇| 男女在线观看视频| 国产精品一区二区三区免费观看| 1024日韩| 野花社区视频在线观看| 粉嫩av一区二区三区免费野| 欧美日韩在线精品一区二区三区激情综 | 日韩无一区二区| 99热99re6国产在线播放| 精品日韩美女| 日本午夜精品一区二区三区电影| 中国1级黄色片| 日韩一区二区三区免费看 | 色婷婷精品久久二区二区蜜臀av | 亚洲综合图色| 国产又黄又猛又粗| 亚洲另类色综合网站| 黄色av小说在线观看| 欧美一区亚洲一区| 日韩理论电影院| 男人添女人荫蒂国产| 精品人伦一区二区三区蜜桃网站 | 欧美激情一区二区三区| 亚洲在线免费观看视频| 久热精品在线视频| 成人另类视频| 凹凸日日摸日日碰夜夜爽1| 亚洲国产精品av| 刘亦菲毛片一区二区三区| 日本一区二区在线播放| 天天影视天天精品| 国产伦精品一区三区精东| 91福利区一区二区三区| 国产原创精品视频| 精品麻豆av| 免费高清成人在线| 欧美日韩一级在线观看| 亚洲人成电影网站色xx| 国产精品国产亚洲精品| 欧美国产激情视频| 自拍偷拍欧美激情| 亚洲欧美日本在线观看| 91久久国产精品| 性欧美暴力猛交另类hd| 精品人妻伦九区久久aaa片| 国产婷婷成人久久av免费高清 | 91精品在线观看入口| 中文在线最新版地址| 亚洲一区 在线播放| 国产亚洲一二三区|