Skip to main content

10 posts tagged with "Production Practice"

View All Tags

Introduction:This article mainly introduces the architecture upgrade and evolution of the self-migrating MySQL data to Hive, the original architecture involves many components, complex links, and encounters many challenges, and effectively solves the dilemmas and challenges encountered in data integration after using the combination of StreamPark + Paimon, and shares the specific practical solutions of StreamPark + Paimon in practical applications, as well as the advantages and benefits brought by this rookie combination solution.

StreamPark: https://github.com/apache/streampark

Paimon: https://github.com/apache/paimon

Welcome to follow, star, fork, and participate in contributions

Contributor|Beijing Ziru Information Technology Co., Ltd.

Authors of the article|Liu Tao, Liang Yansheng, Wei Linzi

Article compilation|Yang Linwei

Content proofreading|Pan Yuepeng

Abstract: This article is compiled from the sharing of Mu Chunjin, the head of China Union Data Science's real-time computing team and Apache StreamPark Committer, at the Flink Forward Asia 2022 platform construction session. The content of this article is mainly divided into four parts:

  • Introduction to the Real-Time Computing Platform Background
  • Operational Challenges of Flink Real-Time Jobs
  • Integrated Management Based on StreamPark
  • Future Planning and Evolution

Foreword: This article mainly introduces the implementation of a streaming data warehouse by Bondex, a supply chain logistics service provider, in the process of digital transformation using the Paimon + StreamPark platform. We provide an easy-to-follow operational manual with the Apache StreamPark integrated stream-batch platform to help users submit Flink tasks and quickly master the use of Paimon.

  • Introduction to Company Business
  • Pain Points and Selection in Big Data Technology
  • Production Practice
  • Troubleshooting Analysis
  • Future Planning

Preface: This article primarily discusses the challenges encountered by Shunwang Technology in using the Flink computation engine, and how StreamPark is leveraged as a real-time data platform to address these challenges, thus supporting the company's business on a large scale.

  • Introduction to the company's business
  • Challenges encountered
  • Why choose StreamPark
  • Implementation in practice
  • Benefits Brought
  • Future planning

Abstract: This article originates from the production practices of StreamPark at Dustess Information, written by the senior data development engineer, Gump. The main content includes:

  1. Technology selection
  2. Practical implementation
  3. Business support & capability opening
  4. Future planning
  5. Closing remarks

Dustess Information is a one-stop private domain operation management solution provider based on the WeChat Work ecosystem. It is committed to becoming the leading expert in private domain operation and management across all industries, helping enterprises build a new model of private domain operation management in the digital age, and promoting high-quality development for businesses.

Currently, Dustess has established 13 city centers nationwide, covering five major regions: North China, Central China, East China, South China, and Southwest China, providing digital marketing services to over 10,000 enterprises across more than 30 industries.