Network Protocol: Proxy Protocol of haproxy

Introduction

The proxy should be familiar to everyone, the more famous ones are nginx, apache HTTPD, stunnel, etc.

We know that the proxy is to replace the client to make a message request to the server, and we hope to retain the initial TCP connection information, such as source and destination IP and port, in the process of proxy, to provide some personalized operations.

In general, in order to achieve this goal, there are some ready-made solutions, such as in the HTTP protocol, you can use the "X-Forwarded-For" header to include information about the original source address, and "X-Original-To" Information used to carry the destination address.

For another example, in the SMTP protocol, the XCLIENT protocol can be specially used for mail exchange.

Or you can compile the kernel with your proxy as the default gateway for your server.

Although these methods are available, they have more or less restrictions, either related to the protocol or modifying the system architecture, so the scalability is not strong.

Especially in the case of multiple proxy servers chained calls, the above method is almost impossible to complete.

This requires a unified proxy protocol, through which all nodes are compatible with this proxy protocol, and the chain call of the proxy can be seamlessly implemented. This proxy protocol is the proxy Protocol proposed by haproxy in 2010.

The advantages of this proxy protocol are:

It is protocol agnostic (can be used with any layer 7 protocol, even with encryption)
It does not require any infrastructure changes
Can penetrate NAT firewall
it is extensible

And haproxy itself is a very good open source load balancing and proxy software, providing high load capacity and excellent performance, so it is widely used in many companies, such as: GoDaddy, GitHub, Bitbucket, Stack Overflow, Reddit, Slack, Speedtest .net, Tumblr, Twitter, etc.

What I want to introduce today is the underlying details of haproxy's Proxy Protocol.

Implementation details of the Proxy Protocol

We mentioned above that the purpose of Proxy Protocol is to carry some fields that can mark the initial TCP connection information, such as IP address and port.

If the client and server are directly connected, the server can obtain the following information through getsockname and getpeername:

address family: AF_INET for IPv4, AF_INET6 for IPv6, AF_UNIX
socket protocol: SOCK_STREAM for TCP, SOCK_DGRAM for UDP
Source and destination addresses at the network layer
The source and destination port numbers of the transport layer

Therefore, the purpose of Proxy Protocol is to encapsulate the above information, and then put the above information into the request header, so that the server can correctly read the client's information.

In the Proxy Protocol, two versions are defined.

In version 1, the header file information is in text form, that is, human-readable. This method is mainly used to ensure better debuggability in the early stage of protocol application, so as to quickly correct the scene.

In version 2, the binary encoding function of the header file is provided. On the premise that the functions of version 1 have been basically perfected, binary encoding is provided, which can effectively improve the transmission and processing performance of the application.

Because there are two versions, the receiving end of the server also needs to implement support for the corresponding version.

In order to better apply the Proxy Protocol, the Proxy Protocol actually defines only one header information. This request header will be placed at the beginning of each connection when the connection initiator initiates the connection. And the protocol is stateless because it doesn't expect the sender to wait for the receiver before sending headers, nor does it expect the receiver to send anything back.

Next, we specifically observe the implementation of the two versions of the protocol.

version 1

In version 1, the proxy header consisted of a string of US-ASCII encoded strings. This proxy header will be sent before the connection is established between the client and server, and before any real data is sent.

Let's first look at an example of an http request using a proxy header:

 PROXY TCP4 192.168.0.1 192.168.0.102 12345 443\r\n
    GET / HTTP/1.1\r\n
    Host: 192.168.0.102\r\n
    \r\n

In the above example, \r\n means carriage return and line feed, which is the end-of-line marker. The code sends an HTTP request to host:192.168.0.102, and the first line is the proxy header used.

What exactly does it mean?

The first is the string "PROXY", indicating that this is a proxy protocol header, and is the v1 version.