NVIDIA B200 GPUs available now!
Cluster Edition

Load Balancing Cluster – HaProxy

  • Cluster Deployment
  • Multi-Protocol Support
  • High Availability Mode
  • Flexible Scheduling
  • Session Persistence
  • Flexible Billing Options
Get Started

Product Features

The HaProxy Load Balancing Cluster Service provides a reliable load balancing solution within the customer's network. Based on predefined configuration rules, it automatically distributes application traffic across multiple cloud servers. This enables higher application fault tolerance and seamlessly delivers the load balancing capacity needed to manage application traffic efficiently.

By integrating with security layers, HaProxy offers enhanced and secure application access mechanisms. It can also be deployed directly on external networks to quickly enable business traffic distribution.

Product Features

Multi-Protocol Support

Supports Layer 4–7 load balancing. Traffic is distributed to backend servers based on forwarding rules configured for HTTP or TCP protocols.

Session Persistence

Offers HTTP session persistence. When enabled, user requests are consistently forwarded to the same backend server based on user session characteristics, ensuring a stable experience.

Automated Health Checks

Performs backend server health checks according to user-defined rules. Faulty servers are quickly isolated to maintain service availability.

Real-Time Operational Monitoring

Provides real-time insights into HaProxy cluster status, backend server health, queue length, session counts, and inbound/outbound bandwidth.

High Availability, Security, and Stability

Clustered deployment ensures fast failover, high availability, and stable service performance.

Application Scenarios

To improve service availability, businesses typically deploy multiple cloud servers in combination with load balancing. This approach allows for the seamless implementation of high-availability solutions without requiring changes to the application layer. Traffic is intelligently distributed to different cloud servers based on predefined load balancing policies.

With real-time monitoring, the health of backend services is continuously assessed. In the event of an abnormal instance, traffic is automatically rerouted to healthy servers, and once the issue is resolved, the instance is smoothly reintegrated into the load pool.

Scenario 1:

Scenario 2:

CONTACT

Let’s Build the Future
of AI Together

Whether you need custom AI training solutions, scalable models, or expert guidance, we’re here to help. Get in touch and let’s unlock the next stage of AI innovation—together.

Have a project? Let’s talk.