物联网 - HStreamDB Newsletter 2022-05｜Decentralized cluster mechanism, new data integration framework, new clients and deployment methods - 个人文章

This month, the HStreamDB team officially released v0.8 and started development work on v0.9, which will bring major improvements in clustering, external system integration, partitioning, and more. This month, we mainly completed the design and preliminary development of HStream IO, a new cluster mechanism and data integration framework, and started the development of a new Python client. At the same time, the Erlang client version 0.1 was officially released, and the deployment support of Helm and Alibaba Cloud was added.

HServer cluster mechanism improvement

In v0.8 and earlier versions, the HServer cluster mainly adopts the centralized clustering mechanism based on ZooKeeper. ZooKeeper is used to register and discover HServer nodes and coordinate between nodes. There is no direct communication between HServer nodes. This clustering scheme is adopted by a large number of distributed systems and is relatively mature. The main disadvantage is that it needs to rely on external systems such as ZooKeeper, which is not flexible enough, and has some limitations in terms of scalability.

In order to support larger clusters and better scalability, as well as reduce dependence on external systems, v0.9 will adopt a decentralized clustering mechanism. The new clustering scheme will be mainly based on the SWIM[1] paper, and its core includes a A set of efficient failure dectation algorithm and gossip style cluster message propagation mechanism, similar solutions have been applied in distributed systems such as Consul and Cassandra. At present, the new cluster related functions are still in the research and development process and will be officially released in v0.9.

New data integration framework HStream IO

In order to meet a variety of different business needs, there are often multiple sets of data systems or data platforms within an enterprise, including but not limited to: online transaction library, offline analysis library, cache system, search system, batch processing system, real-time processing system, data Lake and more. While focusing on streamlining and reshaping the real-time data stack, HSteamDB, as an emerging streaming database, also shoulders the mission of promoting the efficient flow of data throughout the data stack and promoting the modernization and real-timeization of enterprise data stacks. The ability to integrate with numerous external systems is also very important to HStreamDB.

HStream IO is the data integration framework within HStreamDB. It includes components such as source connectors, sink connectors, and IO Runtime. It can import data from external systems into HStreamDB through source connectors, and export data in HStreamDB to external systems through sink connectors. . It is also worth noting that HStream IO will be implemented based on Airbyte spec, which means that we will be able to fully reuse a large number of open source connectors in the Airbyte community, and quickly integrate HStreamDB with any system. This month HStream IO has completed the design and preliminary development work, and will be officially released in v0.9.

Client update

Add Python client

This month we also started the research and development of HStreamDB's Python client hstreamdb-py, which supports Python 3.7 and above, and will be officially released next month.

hstreamdb-erlang v0.1 released

This month, HStreamDB's Erlang client hstreamdb-erlang officially released v0.1. For details, please refer to https://github.com/hstreamdb/hstreamdb-erlang/blob/main/README.md

Deployment method update

Added Helm-based deployment support

Helm ( https://helm.sh/ ) can help users install and manage K8s applications more easily. This month, HStreamDB also provides Helm-based deployment support. For details, please refer to the document https://hstream.io/docs/en /latest/deployment/deploy-helm.html#building-your-kubernetes-cluster

Added Alibaba Cloud Terraform deployment support

Previously, we provided a tutorial on deploying HStreamDB on AWS and HUAWEI CLOUD based on Terraform. This month, we added support for deployment on Alibaba Cloud. For details, please refer to the document https://hstream.io/docs/zh/latest/deployment /deploy-terraform-aliyun.html

[1]: Das, A., Gupta, I. and Motivala, A., 2002, June. Swim: Scalable weakly-consistent infection-style process group membership protocol. In Proceedings International Conference on Dependable Systems and Networks (pp. 303 -312).IEEE.

Copyright statement: This article is original by EMQ, please indicate the source when reprinting.
Original link: https://hstream.io/zh/blog/hstreamdb-newsletter-202205

HStreamDB Newsletter 2022-05｜Decentralized cluster mechanism, new data integration framework, new clients and deployment methods

HServer cluster mechanism improvement

New data integration framework HStream IO

Client update

Add Python client

hstreamdb-erlang v0.1 released

Deployment method update

Added Helm-based deployment support

Added Alibaba Cloud Terraform deployment support

EMQX

引用和评论

在 Windows 平台搭建 MQTT 服务

深入探索嵌入式开发中的 FreeRTOS：从入门到精通

支付宝 IoT 设备入门宝典（下）设备经营篇

《ESP32-S3使用指南—IDF版 V1.6》第八章 MENUCONFIG菜单配置

《ESP32-S3使用指南—IDF版 V1.6》第九章 IDF组件注册表

2000 万 Tokens！告别服务器繁忙焦虑，让您免费极致体验满血 DeepSeek

破局企业AI落地难题！迅易科技DeepSeek私有化部署全场景解决方案