2022/8/13 Data Everywhere Series Events-Hangzhou Station
Open source meets big data
Data is ubiquitous, and under the wave of big data, wave after wave of open source projects born for data have begun to rise. We can not only see the acceleration effect of open source on data application, but also try to use data to analyze open source, open source and What kind of sparks will emerge from the encounter of big data? In this issue of sharing, we will hold four different lectures around big data and open source projects, from distributed file systems to HTAP databases, to insight analysis of open source data and the evolution of message queues. Enterprises and individuals related to open source big data bring enough benefits.
Event Information and Registration
Event time: 13:00-17:10 pm on August 13
Venue: Herun House, 9th Floor, Building 2, No. 59, Xiba Road, Yuhang District, Hangzhou (diagonally opposite to Shuzhi Engine)
Registration method: https://mini.awsapp.cn/l/ptjvfP5O0FJj
Reminder: According to Hangzhou's epidemic prevention policy, a negative nucleic acid test certificate within seven days is required, and the latest epidemic prevention policy shall prevail.
schedule
13:00-13:30 Check-in
13:30-13:40 Opening Introduction and Amazon Cloud Technology Community Introduction
13:45-14:30 Gao Changjian "Analysis of Metadata Design of Large-Scale Distributed File System"
14:30-15:15 Li Hao "What is the real HTAP database?"
15:15-15:45 Coffee Break
15:45-16:15 Zhao Shengyu, "Big Data in Open Source and Open Source Data Solutions"
16:15-17:00 Shen Yuhao "The Evolution of Message Queuing in the Cloud Native Era"
17:00-17:10 Interactive & Lucky Draw & Closing & Group Photo
Gao Changjian Juicedata Technical Expert
Sharing topic: Analysis of metadata design of large-scale distributed file system
Instructors:
Participated in the construction of the main team of the JuiceFS open source community. Ten years of experience in the Internet industry. He has served as an architect in Zhihu, Immediate, and Xiaohongshu, focusing on technical research in the fields of distributed systems, big data, and Al.
Share content:
1. What is a distributed file system
2 Introduction to the architecture of the industry's large-scale distributed file system
3. What is the metadata of the file system
4. How to design the metadata of the file system
5. Follow-up Outlook
Listener Benefit:
1. Understand the concept of distributed file system
2. Understand the architecture design of large-scale distributed file systems in the industry
3. Understand how to design the metadata of a file system
Li Hao StoneDB Chief Architect
Share topic: What is a real HTAP database?
Instructors:
Chief Architect of StoneDB, worked in Huawei, iQiyi, and Founder of Peking University to design the core architecture of the database kernel. More than 10 years of experience in database kernel development, good at query engine, execution engine, large-scale parallel processing and other technologies. Possess dozens of database invention patents, and author of "PostgreSOL Query Engine Source Code Technology Analysis"
Share content:
1. What is an HTAP database
2. What is the background of HTAP
3. What capabilities should a real HTAP have
4. Practical experience of HTAP
5. Thoughts on open source databases
Listener Benefit:
1. Understand what TP database, AP database, and HTAP database are, and their usage scenarios
2. Understand what capabilities a real HTAP database should have
3. Learn from experience in HTAP database practice
Zhao Shengyu, Director of Open Source Society, PhD student in Computer Science, Tongji University
Sharing topic: Open Source Data Solutions_ Big Data in Open Source
Instructors:
- 2022 Open Source Society Director
- PhD student at Tongji University X-lab
- Mainly doing theoretical research and data analysis related to open source. Former member of Alibaba Open Source Office
Speech content:
1. What is open data in open source?
2. Introduction to open source community measurement methods in international industries
3. Open source data analysis and insight under graph algorithms
4. Visualization in Open Source Data
Listener benefits:
1. Understand big data in an open source world
2. Understand the current mainstream data measurement methods
3. Understand the open source world from a network perspective
Shen Yuhao StreamNative Product Manager
Sharing topic: Evolution of message queues in the cloud-native era
Instructors:
Currently working as a product manager at StreamNative, mainly responsible for product management related to private cloud and PulsarOperator. He has worked on customer success and product management in Microsoft, Qiniu, and PingCAP, focusing on distributed systems, cloud native, and big data.
Speech content:
1. The evolution of message queues
2. Typical characteristics of cloud-native message queues
3. Introduction to Apache Pulsar
- Hierarchical Sharding Architecture
- Unified messaging model and protocol - built-in enterprise-grade features
4. Apache Pulsar application scenarios
- financial transaction scene
- Multi-active scenarios across regions
- Batch stream fusion real-time data warehouse
- IoT Scenario
Listener Benefit:
1. Understand some common features of message queues and some features of mainstream message queues
2. Understand the basic concepts and architectural features of Apache Pulsar
3. Understand the application scenarios that Apache Pulsar is suitable for
Activity Welfare
Benefit one:
In offline activities, in addition to sharing content full of dry goods, there must be a delicate and delicious coffee break!
Benefit two:
Sign in at the event site to receive the exquisite custom peripherals from the Data Everywhere series!
Benefit three:
During the on-site question and answer session, you will also have the opportunity to obtain other exquisite peripherals carefully prepared for you by User Group and the cooperative community~
**粗体** _斜体_ [链接](http://example.com) `代码` - 列表 > 引用
。你还可以使用@
来通知其他用户。