头图

2022/8/13 Data Everywhere Series Events-Hangzhou Station

Open source meets big data

Data is ubiquitous, and under the wave of big data, wave after wave of open source projects born for data have begun to rise. We can not only see the acceleration effect of open source on data application, but also try to use data to analyze open source, open source and What kind of sparks will emerge from the encounter of big data? In this issue of sharing, we will hold four different lectures around big data and open source projects, from distributed file systems to HTAP databases, to insight analysis of open source data and the evolution of message queues. Enterprises and individuals related to open source big data bring enough benefits.

Event Information and Registration

Event time: 13:00-17:10 pm on August 13

Venue: Herun House, 9th Floor, Building 2, No. 59, Xiba Road, Yuhang District, Hangzhou (diagonally opposite to Shuzhi Engine)

Registration method: https://mini.awsapp.cn/l/ptjvfP5O0FJj

Reminder: According to Hangzhou's epidemic prevention policy, a negative nucleic acid test certificate within seven days is required, and the latest epidemic prevention policy shall prevail.

schedule

13:00-13:30 Check-in

13:30-13:40 Opening Introduction and Amazon Cloud Technology Community Introduction

13:45-14:30 Gao Changjian "Analysis of Metadata Design of Large-Scale Distributed File System"

14:30-15:15 Li Hao "What is the real HTAP database?"

15:15-15:45 Coffee Break

15:45-16:15 Zhao Shengyu, "Big Data in Open Source and Open Source Data Solutions"

16:15-17:00 Shen Yuhao "The Evolution of Message Queuing in the Cloud Native Era"

17:00-17:10 Interactive & Lucky Draw & Closing & Group Photo

Gao Changjian Juicedata Technical Expert

Sharing topic: Analysis of metadata design of large-scale distributed file system

Instructors:
Participated in the construction of the main team of the JuiceFS open source community. Ten years of experience in the Internet industry. He has served as an architect in Zhihu, Immediate, and Xiaohongshu, focusing on technical research in the fields of distributed systems, big data, and Al.

Share content:
1. What is a distributed file system
2 Introduction to the architecture of the industry's large-scale distributed file system
3. What is the metadata of the file system
4. How to design the metadata of the file system
5. Follow-up Outlook

Listener Benefit:
1. Understand the concept of distributed file system
2. Understand the architecture design of large-scale distributed file systems in the industry
3. Understand how to design the metadata of a file system

Li Hao StoneDB Chief Architect

Share topic: What is a real HTAP database?

Instructors:
Chief Architect of StoneDB, worked in Huawei, iQiyi, and Founder of Peking University to design the core architecture of the database kernel. More than 10 years of experience in database kernel development, good at query engine, execution engine, large-scale parallel processing and other technologies. Possess dozens of database invention patents, and author of "PostgreSOL Query Engine Source Code Technology Analysis"

Share content:
1. What is an HTAP database
2. What is the background of HTAP
3. What capabilities should a real HTAP have
4. Practical experience of HTAP
5. Thoughts on open source databases

Listener Benefit:
1. Understand what TP database, AP database, and HTAP database are, and their usage scenarios
2. Understand what capabilities a real HTAP database should have
3. Learn from experience in HTAP database practice

Zhao Shengyu, Director of Open Source Society, PhD student in Computer Science, Tongji University

Sharing topic: Open Source Data Solutions_ Big Data in Open Source

Instructors:

  • 2022 Open Source Society Director
  • PhD student at Tongji University X-lab
  • Mainly doing theoretical research and data analysis related to open source. Former member of Alibaba Open Source Office

Speech content:
1. What is open data in open source?
2. Introduction to open source community measurement methods in international industries
3. Open source data analysis and insight under graph algorithms
4. Visualization in Open Source Data

Listener benefits:
1. Understand big data in an open source world
2. Understand the current mainstream data measurement methods
3. Understand the open source world from a network perspective

Shen Yuhao StreamNative Product Manager

Sharing topic: Evolution of message queues in the cloud-native era

Instructors:
Currently working as a product manager at StreamNative, mainly responsible for product management related to private cloud and PulsarOperator. He has worked on customer success and product management in Microsoft, Qiniu, and PingCAP, focusing on distributed systems, cloud native, and big data.

Speech content:
1. The evolution of message queues
2. Typical characteristics of cloud-native message queues
3. Introduction to Apache Pulsar

  • Hierarchical Sharding Architecture
  • Unified messaging model and protocol - built-in enterprise-grade features

4. Apache Pulsar application scenarios

  • financial transaction scene
  • Multi-active scenarios across regions
  • Batch stream fusion real-time data warehouse
  • IoT Scenario

Listener Benefit:
1. Understand some common features of message queues and some features of mainstream message queues
2. Understand the basic concepts and architectural features of Apache Pulsar
3. Understand the application scenarios that Apache Pulsar is suitable for

Activity Welfare

Benefit one:
In offline activities, in addition to sharing content full of dry goods, there must be a delicate and delicious coffee break!

Benefit two:
Sign in at the event site to receive the exquisite custom peripherals from the Data Everywhere series!

image.png

Benefit three:
During the on-site question and answer session, you will also have the opportunity to obtain other exquisite peripherals carefully prepared for you by User Group and the cooperative community~


亚马逊云开发者
2.9k 声望9.6k 粉丝

亚马逊云开发者社区是面向开发者交流与互动的平台。在这里,你可以分享和获取有关云计算、人工智能、IoT、区块链等相关技术和前沿知识,也可以与同行或爱好者们交流探讨,共同成长。