Apache Doris provides a new data type: Variant, for semi-structured data analysis, which enables 8 times faster query performance than JSON with 1/3 storage.
Semi-structured data is data arranged in flexible formats. Unlike structured data, it does not require data users to pre-define the table schema for it, so it provides convenience for data storage and analysis. Common forms of semi-structured data include XML, JSON, and log files. They are widely seen in the following industry scenarios: platforms store user reviews of products as semi-structured data for sentiment analysis and user behavior pattern mining.
test_variant; USE test_variant; CREATE TABLE IF NOT EXISTS github_events NULL, actor VARIANT NULL, repo VARIANT NULL, payload VARIANT NULL, public BOOLEAN NULL, created_at DATETIME NULL, INDEX idx_payload USING INVERTED PROPERTIES COMMENT 'inverted index for payload' ) DUPLICATE KEY DISTRIBUTED BY HASH BUCKETS 10 properties; Note: If the column has too many sub-columns, creating indexes on it may lead to an excessive number of index columns and decrease data writing performance.
tbl WHERE CAST MATCH 'hello world' The following are simple examples of queries on Variant columns Retrieve the Top 5 repositories with the most Stars from .
United Kingdom Latest News, United Kingdom Headlines
Similar News:You can also read news stories similar to this one that we have collected from other news sources.
At Citizens Bank Park, new season, new bites: hoagies, new Schwarburger, ‘Always Sunny’ whiskey and morePhillies are bringing exciting new concessions to Citizens Bank Park, including hoagies and 'It's Always Sunny in Philadelphia' whiskey.
Read more »
Breaking Down Data Silos: How Apache Doris Streamlines Customer Data IntegrationLearn how Apache Doris breaks down data silos for insurance firms, streamlining customer data integration and boosting efficiency.
Read more »
Shift-Left Data Platforms in Early-Stage Startups: Strategies for Data-Driven SuccessLeft-Shift Data Platform: How to overcome early stage startup challenges to be Data-Driven
Read more »
Enterprise Data Technology Part 5 — Data Quality With AcumaticaAnalyst Robert Kramer continues his series on enterprise data technology with a detailed discussion of why data quality is so important, and how companies can achieve it.
Read more »
On-Chain Data Product Insights: The Data Analysis Revolution in the Web3 EraIn the rapidly evolving blockchain space today, on-chain data has become a core asset with an increasingly vital role in the ecosystem.
Read more »
Millions of customers' data found on dark web in latest AT&T data breachThe provider says it has reset the passcodes of the current account holders whose data was compromised as it investigates the leak, the latest in a string of telecommunications company data breaches.
Read more »