ARCADE: A Real-Time Data System for Hybrid and Continuous Query Processing across Diverse Data Modalities

System architecture of ARCADE: query interface, query processing, and storage layers over a unified disk-based secondary index on LSM storage.

Abstract

The explosive growth of multimodal data spanning text, spatial, vector, and relational modalities, coupled with the need for real-time semantic search and retrieval, has outpaced the capabilities of existing systems, which either lack efficient ingestion and continuous query capabilities, or fall short in supporting expressive hybrid analytics. We introduce ARCADE, a real-time data system that efficiently supports high-throughput ingestion and expressive hybrid and continuous query processing across diverse data types. ARCADE introduces a unified disk-based secondary index framework on LSM-based storage for vector, spatial, and text data modalities, a comprehensive cost-based query optimizer that jointly leverages multiple heterogeneous indexes for hybrid queries, and an incremental materialized view framework for efficient continuous queries. ARCADE is built on open-source RocksDB and MySQL, with comprehensive system design and experiments presented in our ICDE 2026 research paper. Our demonstration showcases ARCADE through interactive real-world scenarios of social media marketing and equity research, exposing system-level metrics, internal query optimization decisions, and side-by-side baseline comparisons to provide attendees with deep insights into how ARCADE’s architectural choices facilitate real-time multimodal analytics.

Type
Publication
VLDB 2026 Demonstration