Powered By
Here is a list of organizations using Scio in production.
Organization | Use Case | Code |
---|---|---|
Spotify | Everything including music recommendation, monetization, artist insights and business analysis. We also use BigQuery, Bigtable and Datastore heavily with Scio. | Big Data Rosetta Code, Ratatool, Featran |
Big Viking Games | Streaming event collection and ETL using Pub/Sub and BigQuery | |
Algolia | Log collection and analytics using Bigtable, Cloud Storage & Pub/sub | |
Hypefactors | Natural language processing / media monitoring. Also using PubSub, GCS and ElasticSearch with Scio. | |
Discord | Streaming event collection, sessionization, and enrichment using Pub/Sub, BigQuery, and Bigtable. | |
Dow Jones | Streaming article events, bulk article extractions and ETL using Pub/Sub, GCS and BigQuery for the DNA Platform. | |
Honey | Streaming ETL data pipeline from Pub/Sub to Pub/Sub, BigTable, BigQuery. | |
Cabify | Streaming data pipelines from Pub/Sub to BigQuery | |
Jobrapido | Streaming and Batch ETL using Pub/Sub and BigQuery | |
9GAG | Streaming and Batch ETL using Pub/Sub | |
Cityblock | Streaming and Batch ETL using Pub/Sub, BigQuery, and Datastore | |
Arquivei | Streaming and Batch ETL using Pub/Sub, BigQuery, GCS, S3, and ElasticSearch | |
Vpon | Batch ETL and BigQuery | |
Snowplow Analytics | Streaming ETL | Beam Enrich, BigQuery Loader and Cloud Storage Loader |
Quibi | Streaming and Batch ETL from Pub/Sub and GCS to BigQuery | |
Ztore | Streaming ETL and event-driven application, using Pub/Sub, BigQuery, Kafka, MongoDB, AWS SQS, and DynamoDB |
0.12.3+0-0b1102d7+20230130-1503*