Streams and Tables:
Two Sides of the Same Coin

Blank
Shadow
Streams and Tables: Two Sides of the Same Coin

Download the Paper

Stream processing has emerged as a paradigm for applications that require low-latency evaluation of operators over unbounded sequences of data. Defining the semantics of stream processing is challenging in the presence of distributed data sources because the physical and logical order of data in a stream may become inconsistent in such a setting.

In this paper, we introduce the Dual Streaming Model. The model presents the result of an operator as a stream of successive updates, which induces a duality of results and streams. As such, it also addresses the inconsistencies between the physical and logical order of streaming data in a continuous manner, without explicit buffering and reordering.

We further discuss the trade-offs and challenges faced when implementing this model in terms of correctness, latency and processing cost. A case study based on Apache Kafka illustrates the effectiveness of the model based on real-world requirements.

サイトのご利用状況を把握しユーザーエクスペリエンスの改善へとつなげるため、当サイトでは Cookie (クッキー) を使用しています。Cookie についての詳細を確認するには、また Cookie 設定の変更をご希望の場合は、こちらをクリックしてください。閲覧を続行することにより、当社の Cookie 使用に同意されたものとみなされます。