New Product Launch: Alibaba Cloud Data Integration

简介: Support online real-time & offline data exchange between all data sources, networks and locations with Alibaba Cloud Data Integration.

Big Data is the new corporate currency. If used correctly, there is immense value to be extracted. Revenues from Big Data services, software and hardware are predicted to reach USD $187BN in 2019, representing an increase of more than 50 percent over a five-year period.

Much of this data will pass through the cloud, with 50 percent of organizations predicted to embrace a cloud-first policy
in 2018 for Big Data and analytics. Enterprises are clearly demanding more flexibility and control over costs than on-premises solutions can deliver.

As the maturity of cloud-based technologies and the surge of Big Data converge, it is impossible to ignore the competitive edge a data processing and warehousing solution that is infinitely scalable and equally elastic brings to the enterprise. The tipping point for Big Data is here.

But why are we seeing a surge in cloud demand now? One major reason is the fact that technologies powering the cloud have not just increased in sophistication but concerns about security in the cloud have also diminished.

From complex, secured APIs to robust authentication and best practices, cloud platforms are investing in a range of features and support to ensure greater security and scalability. This strategy is paying off with the total number of organizations who distrust cloud dropping from 50 percent to 29 percent within just 12 months.

As a major cloud and big data infrastructure provider, Alibaba Cloud provides an expanding suite of cloud-based products to manage commercial big data problems, including Alibaba Cloud Data Integration, which has just recently been launched for the international market.

Data Integration is an all-in-one data synchronization platform that supports online real-time and offline data exchange between all data sources, networks, and locations. Based on an advanced distribution architecture with multiple modules (such as dirty data processing and flow control distributed system), the service provides data transmission, data conversion and synchronization services. It also supports multiple features, including support for multiple data sources, fast transmission, high reliability, scalability, and mass synchronization. Below we will take a closer look at the features and benefits of this new product and how your organization can add Data Integration to fulfill your Big Data processing needs.

Support for Multiple Disparate Data Sources

Data Integration supports data synchronization between more than 400 pairs of disparate data sources (including RDS databases, semi-structured storage, non-structured storage (such as audio, video, and images), NoSQL databases, and big data storage). This also includes important support for real-time data reading and writing between data sources such as Oracle, MySQL, and DataHub.

Scheduled Tasks

Data Integration allows you to schedule offline tasks by setting a specific trigger time (including year, month, day, hour, and minute). It only requires a few steps to configure periodical incremental data extraction. Data Integration works perfectly with DataWorks data modeling. The entire workflow is an integration of operations and maintenance.

Mass Upload to Cloud

Data Integration leverages the computing capability of Hadoop clusters to synchronize the HDFS data from clusters to MaxCompute, known as Mass Cloud Upload. Data Integration can transmit up to 5TB of data per day and the maximum transmission rate is 2GB/s.

Monitoring and Alarms

With 19 built-in monitoring rules, Data Integration applies to most monitoring scenarios. You can set alarm rules based on these monitoring rules. Additionally, you can pre-define the task failure notification mode for Data Integration.

Data Source Management

By leveraging the data sources and datasets that define the source and destination of data, Data Integration provides two data management plug-ins. The Reader plug-in is used to read data and the Writer plug-in is used to write data. Based on this framework, a set of simplified intermediate data transmission formats is developed to exchange data between arbitrary structured and semi-structured data sources.

Local Data Collection

Data Integration supports data synchronization in Alibaba Cloud classic networks and VPCs (virtual private cloud), as well as data collection in local IDCs.

Full Database Migration

Data Integration provides a full database migration tool which allows the creation of multiple data synchronization tasks and imports all data tables in a MySQL database to MaxCompute. By using full database migration, you no longer need to create synchronization tasks one at a time.

Incremental Synchronization

By using the WHERE clause, Data Integration supports business data filtering by date. Data with different dates is synchronized to the relevant MaxCompute partition tables. By setting the synchronization interval to 1 hour or 10 minutes, Data Integration is capable of performing quasi-real-time incremental synchronization.

To learn more about Data Integration, visit the product page at Alibaba Cloud today.

目录
相关文章
|
IDE Java Maven
Idea安装及项目设置配置和基本使用
Idea安装及项目设置配置和基本使用
1421 0
Idea安装及项目设置配置和基本使用
|
Java 编译器 Maven
Java“class file contains wrong class”解决
当Java程序运行时出现“class file contains wrong class”错误,通常是因为类文件与预期的类名不匹配。解决方法包括:1. 确保类名和文件名一致;2. 清理并重新编译项目;3. 检查包声明是否正确。
516 3
|
新零售 中间件 Java
阿里电商架构演变之路
首届阿里巴巴中间件技术峰会上,阿里巴巴中间件技术部专家唐三带来“阿里电商架构演变之路”的演讲,本文从阿里业务和技术架构开始引入,分别分享了阿里电商从1.0到4.0架构的演变之路,着重分析了分布式和异地多活的改变之路。
21323 124
|
算法 搜索推荐 Java
十大经典排序算法动画解析和 Java 代码实现【详细全代码】
排序算法可以分为内部排序和外部排序。内部排序是数据记录在内存中进行排序。而外部排序是因排序的数据很大,一次不能容纳全部的排序记录,在排序过程中需要访问外存。 常见的内部排序算法有:插入排序、希尔排序、选择排序、冒泡排序、归并排序、快速排序、堆排序、基数排序等。
292 0
十大经典排序算法动画解析和 Java 代码实现【详细全代码】
|
5天前
|
云安全 人工智能 运维
阿里云SecOps Agent,全新安全跨产品执行体验
自然语言驱动 云安全中心/WAF/CFW/ 等多款安全产品联动
1603 2
|
2天前
|
人工智能 定位技术 SEO
我学 GEO 第 15 天:终于知道AI GEO该如何做?
我是暴走的莉莉酱,边旅行边研究AI GEO的数字游民。专注普通人如何提升“AI可见度”——让AI在回答用户问题时准确识别、理解并推荐你。不讲玄学,只做可测、可调、可持续的GEO实践。
365 124
|
5天前
|
机器学习/深度学习 人工智能 调度
🐴 HappyHorse 1.1 现已上线阿里云百炼!快来查收模型使用指南,现在调用享 6 折~
HappyHorse 1.1 是新一代视频生成大模型,全面升级动态表现力、角色一致性、指令遵循、视觉质感与音画协同能力。支持I2V/T2V/R2V三类生成,适配短剧、电商广告、品牌营销等场景,提供高质、流畅、可控的AI视频生产力。
625 4
🐴 HappyHorse 1.1 现已上线阿里云百炼!快来查收模型使用指南,现在调用享 6 折~
|
2天前
|
缓存 人工智能 运维
阿里云618百炼大模型Qwen3.7-Max功能、免费试用、订阅计费、配置接入详解
Qwen3.7-MAX是阿里云百炼平台推出的通义千问3.7系列旗舰大语言模型,专为智能体时代复杂任务打造,依托阿里云全域算力与自研技术,在逻辑推理、长文本处理、代码工程、长周期自主执行等领域达到行业顶尖水平。2026年618期间,该模型推出多重免费试用权益、按量计费5折、订阅套餐优惠等专属福利,覆盖个人开发者、团队与企业全场景需求,以下从核心功能、免费试用、订阅计费、配置接入四方面展开详细解析。
364 123
|
16天前
|
缓存 测试技术 API
Qwen 3.7 Plus 与 Max 实测:性价比与多模态能力差异解析(2026)
2026 年 6 月 1 日,阿里悄无声息地发布了 Qwen 3.7 Plus,距 Qwen 3.7 Max 上线刚好 11 天。同样的 1M 上下文,同样的 35 小时自治上限。但价格才是头条:Plus 是 0.40/M输入,Max是 2.50/M——便宜约 6 倍——并且还能看图、看视频。Vision Arena 上 Plus 已经排到 #16。所以这周真正值得讨论的问题不是”要不要为视觉能力买单”,而是”Max 凭什么用 6 倍价格换来 2 个百分点的 benchmark 领先”。