备案控制台

开发者社区

开发者社区大数据文章正文

193 DStream相关操作 - Output Operations on DStreams

2023-11-01 11

版权

版权声明：

本文内容由阿里云实名注册用户自发贡献，版权归原作者所有，阿里云开发者社区不拥有其著作权，亦不承担相应法律责任。具体规则请查看《阿里云开发者社区用户服务协议》和《阿里云开发者社区知识产权保护指引》。如果您发现本社区中有涉嫌抄袭的内容，填写侵权投诉表单进行举报，一经查实，本社区将立刻删除涉嫌侵权内容。

简介： 193 DStream相关操作 - Output Operations on DStreams

Output Operations可以将DStream的数据输出到外部的数据库或文件系统，当某个Output Operations原语被调用时（与RDD的Action相同），streaming程序才会开始真正的计算过程。

Output Operation	Meaning
print()	Prints the first ten elements of every batch of data in a DStream on the driver node running the streaming application. This is useful for development and debugging.
saveAsTextFiles(prefix, [suffix])	Save this DStream’s contents as text files. The file name at each batch interval is generated based on prefix and suffix: “prefix-TIME_IN_MS[.suffix]”.
saveAsObjectFiles(prefix, [suffix])	Save this DStream’s contents as SequenceFiles of serialized Java objects. The file name at each batch interval is generated based on prefix and suffix: “prefix-TIME_IN_MS[.suffix]”.
saveAsHadoopFiles(prefix, [suffix])	Save this DStream’s contents as Hadoop files. The file name at each batch interval is generated based on prefix and suffix: “prefix-TIME_IN_MS[.suffix]”.
foreachRDD(func)	The most generic output operator that applies a function, func, to each RDD generated from the stream. This function should push the data in each RDD to an external system, such as saving the RDD to files, or writing it over the network to a database. Note that the function func is executed in the driver process running the streaming application, and will usually have RDD actions in it that will force the computation of the streaming RDDs.

文章标签：

分布式计算

流计算

Java

Hadoop

数据库

阿甘兄

目录

相关文章

爱喝汤的技术少年

|

2月前

|

分布式计算 JavaScript 前端开发

Stream学习笔记(二)map与reduce

Stream学习笔记(二)map与reduce

爱喝汤的技术少年

41 0 0

阿甘兄

|

5月前

|

机器学习/深度学习分布式计算 API

192 DStream相关操作 - Transformations on DStreams

192 DStream相关操作 - Transformations on DStreams

阿甘兄

15 0 0

q7s2kces74wvy

|

分布式计算算法大数据

Rdd 算子_转换_sample | 学习笔记

快速学习 Rdd 算子_转换_sample

q7s2kces74wvy

125 0 0

Rdd 算子_转换_sample | 学习笔记

bqospzg5rfs7g

|

消息中间件 SQL 分布式计算

Structured_Sink_Foreach | 学习笔记

快速学习 Structured_Sink_Foreach

bqospzg5rfs7g

99 0 0

Structured_Sink_Foreach | 学习笔记

星繁

|

存储测试技术索引

ES中数据流Data streams详解

ES中数据流Data streams详解

星繁

436 0 0

ES中数据流Data streams详解

秦超峰

|

分布式计算 Java 5G

spark异常：missing an output location for shuffle 0

spark异常：missing an output location for shuffle 0

秦超峰

381 0 0

游客wkxim4agoo6le

|

流计算

Structured Streaming之Event-Time的Window操作

笔记

游客wkxim4agoo6le

159 0 0

Structured Streaming之Event-Time的Window操作

小生凡一

|

分布式计算

RDD的 transformations 和 actions 总结

RDD的transformations和actions 两个RDD：一个RDD包含 {1, 2, 3} , 另一个RDD包含{3, 4, 5}

小生凡一

81 0 0

RDD的 transformations 和 actions 总结

eddie小英俊

|

SQL HIVE Java

Hive ERROR: Out of memory due to hash maps used in map-side aggregation

eddie小英俊

1119 0 0

科技小能手

|

算法关系型数据库 Oracle

11g direct path read介绍:10949 event、_small_table_threshold与_serial_direct_read

科技小能手

1223 0 0

热门文章

最新文章

Linux查看进程的内存占用情况

大咖云集，技术宅开趴倒计时 —— 2017 Kubernetes Meetup | 成都站

ubuntu安装KVM虚拟机管理virt-manager

OpenCV4之C++入门详解（上）

vue2中computed中无法获取到this

Visual Basic快速入门

HDU 4968 Improving the GPA

这56家公司共同发力智慧城市

书写高质量JavaScript代码的要义（The Essentials of Writing High Quality JavaScript）翻译

linux驱动开发--字符设备：信号量

【软件工程】融通未来的工艺：深度解析统一过程在软件开发中的角色

IBM SPSS Modeler分类决策树C5.0模型分析空气污染物数据

【软件工程】走进瀑布模型：传统软件开发的经典之路

数据分享|R语言用lme4多层次（混合效应）广义线性模型（GLM），逻辑回归分析教育留级调查数据（下）

【软件工程】走近演化过程模型：软件开发的不断进化之路

【Mybatis】深入学习MyBatis：概述、主要特性以及配置与映射

【MySQL】数据库规范化的三大法则 — 一探范式设计原则

r语言中对LASSO回归，Ridge岭回归和弹性网络Elastic Net模型实现（下）

【MySQL】数据库中为什么使用B+树不用B树

【MySQL】SQL优化

相关电子书

更多

A stream processing pipeline S

Custom applications with Spark’s RDD

WRITE GRAPH ALGORITHMS LIKE A

下一篇

部署LAMP环境（Alibaba Cloud Linux 3）