大数据 - 【Event Registration】August 13 Hangzhou Station - Open Source Meets Big Data - 亚马逊云开发者

2022/8/13 Data Everywhere Series Events-Hangzhou Station

Open source meets big data

Data is ubiquitous, and under the wave of big data, wave after wave of open source projects born for data have begun to rise. We can not only see the acceleration effect of open source on data application, but also try to use data to analyze open source, open source and What kind of sparks will emerge from the encounter of big data? In this issue of sharing, we will hold four different lectures around big data and open source projects, from distributed file systems to HTAP databases, to insight analysis of open source data and the evolution of message queues. Enterprises and individuals related to open source big data bring enough benefits.

Event Information and Registration

Event time: 13:00-17:10 pm on August 13

Venue: Herun House, 9th Floor, Building 2, No. 59, Xiba Road, Yuhang District, Hangzhou (diagonally opposite to Shuzhi Engine)

Registration method: https://mini.awsapp.cn/l/ptjvfP5O0FJj

Reminder: According to Hangzhou's epidemic prevention policy, a negative nucleic acid test certificate within seven days is required, and the latest epidemic prevention policy shall prevail.

schedule

13:00-13:30 Check-in

13:30-13:40 Opening Introduction and Amazon Cloud Technology Community Introduction

13:45-14:30 Gao Changjian "Analysis of Metadata Design of Large-Scale Distributed File System"

14:30-15:15 Li Hao "What is the real HTAP database?"

15:15-15:45 Coffee Break

15:45-16:15 Zhao Shengyu, "Big Data in Open Source and Open Source Data Solutions"

16:15-17:00 Shen Yuhao "The Evolution of Message Queuing in the Cloud Native Era"

17:00-17:10 Interactive & Lucky Draw & Closing & Group Photo

Gao Changjian Juicedata Technical Expert

Sharing topic: Analysis of metadata design of large-scale distributed file system

Instructors:
Participated in the construction of the main team of the JuiceFS open source community. Ten years of experience in the Internet industry. He has served as an architect in Zhihu, Immediate, and Xiaohongshu, focusing on technical research in the fields of distributed systems, big data, and Al.

Share content:
1. What is a distributed file system
2 Introduction to the architecture of the industry's large-scale distributed file system
3. What is the metadata of the file system
4. How to design the metadata of the file system
5. Follow-up Outlook

Listener Benefit:
1. Understand the concept of distributed file system
2. Understand the architecture design of large-scale distributed file systems in the industry
3. Understand how to design the metadata of a file system

Li Hao StoneDB Chief Architect

Share topic: What is a real HTAP database?

Instructors:
Chief Architect of StoneDB, worked in Huawei, iQiyi, and Founder of Peking University to design the core architecture of the database kernel. More than 10 years of experience in database kernel development, good at query engine, execution engine, large-scale parallel processing and other technologies. Possess dozens of database invention patents, and author of "PostgreSOL Query Engine Source Code Technology Analysis"

Share content:
1. What is an HTAP database
2. What is the background of HTAP
3. What capabilities should a real HTAP have
4. Practical experience of HTAP
5. Thoughts on open source databases

Listener Benefit:
1. Understand what TP database, AP database, and HTAP database are, and their usage scenarios
2. Understand what capabilities a real HTAP database should have
3. Learn from experience in HTAP database practice

Zhao Shengyu, Director of Open Source Society, PhD student in Computer Science, Tongji University

Sharing topic: Open Source Data Solutions_ Big Data in Open Source

Instructors:

2022 Open Source Society Director
PhD student at Tongji University X-lab
Mainly doing theoretical research and data analysis related to open source. Former member of Alibaba Open Source Office

Speech content:
1. What is open data in open source?
2. Introduction to open source community measurement methods in international industries
3. Open source data analysis and insight under graph algorithms
4. Visualization in Open Source Data

Listener benefits:
1. Understand big data in an open source world
2. Understand the current mainstream data measurement methods
3. Understand the open source world from a network perspective

Shen Yuhao StreamNative Product Manager

Sharing topic: Evolution of message queues in the cloud-native era

Instructors:
Currently working as a product manager at StreamNative, mainly responsible for product management related to private cloud and PulsarOperator. He has worked on customer success and product management in Microsoft, Qiniu, and PingCAP, focusing on distributed systems, cloud native, and big data.

Speech content:
1. The evolution of message queues
2. Typical characteristics of cloud-native message queues
3. Introduction to Apache Pulsar

Hierarchical Sharding Architecture
Unified messaging model and protocol - built-in enterprise-grade features

4. Apache Pulsar application scenarios

financial transaction scene
Multi-active scenarios across regions
Batch stream fusion real-time data warehouse
IoT Scenario

Listener Benefit:
1. Understand some common features of message queues and some features of mainstream message queues
2. Understand the basic concepts and architectural features of Apache Pulsar
3. Understand the application scenarios that Apache Pulsar is suitable for

Activity Welfare

Benefit one:
In offline activities, in addition to sharing content full of dry goods, there must be a delicate and delicious coffee break!

Benefit two:
Sign in at the event site to receive the exquisite custom peripherals from the Data Everywhere series!

Benefit three:
During the on-site question and answer session, you will also have the opportunity to obtain other exquisite peripherals carefully prepared for you by User Group and the cooperative community~

【Event Registration】August 13 Hangzhou Station - Open Source Meets Big Data

Open source meets big data

Event Information and Registration

schedule

Activity Welfare

亚马逊云开发者

引用和评论

基于 Agentic AI+Redshift MCP Server 实现 Agentic Data Analysis

基于 MCP 的 AI Agent 应用开发实践

OSPO Summit 2025 正式定档！议题征集同步开启

OSPO Summit 2025 首批议程发布！

Dolphinscheduler IDEA本地调试

【Hadoop】HDFS架构解析

【Hadoop】HBase系统解析及适用场景