Описание тега spark-structured-streaming
Spark Structured Streaming allows processing live data streams using DataFrame and Dataset APIs.
Spark Structured Streaming provides fast, scalable, fault-tolerant, end-to-end exactly-once stream processing with the Dataset/DataFrame APIs available in Python, R (in both sparkr and sparklyr) Scala and Java. Structured streaming is for Spark 2.x and is not to be confused with Spark Streaming which is for Spark 1.x.
External resources:
- The official Structured Streaming Programming Guide
- Structured Streaming In Apache Spark A new high-level API for streaming
See also: