Skip to main content

Cluster Planning

Port Planning​

Doris instances communicate over the network, and their proper functioning requires the following ports to be available. Administrators can adjust Doris' port configuration based on the actual environment:

Instance NamePort NameDefault PortCommunication DirectionDescription
BEbe_port9060FE -> BEThrift Server port on BE, used to receive requests from FE
BEwebserver_port8040BE <-> BEHTTP Server port on BE
BEheartbeat_service_port9050FE -> BEHeartbeat service port (Thrift) on BE, used to receive heartbeats from FE
BEbrpc_port8060FE <-> BE, BE <-> BEBRPC port on BE, used for communication between BEs
FEhttp_port8030FE <-> FE, Client <-> FEHTTP Server port on FE
FErpc_port9020BE -> FE, FE <-> FEThrift Server port on FE, each FE should have the same configuration
FEquery_port9030Client <-> FEMySQL Server port on FE
FEedit_log_port9010FE <-> FEbdbje communication port on FE
Brokerbroker_ipc_port8000FE -> Broker, BE -> BrokerThrift Server port on Broker, used to receive requests from FE and BE

Node Count Planning​

FE Node Count​

FE nodes are primarily responsible for user request handling, query parsing and planning, metadata management, and node management.

For production clusters, it is generally recommended to deploy at least 3 FE nodes to achieve a high-availability environment. FE nodes are divided into the following two roles:

  • Follower nodes: Participate in election operations. When the Master node fails, a Follower node will be selected as the new Master.

  • Observer nodes: Only sync metadata from the Leader node and do not participate in the election. These nodes can be used for horizontal scaling to improve the read service capacity of metadata.

In general, it is recommended to deploy at least 3 Follower nodes. In high-concurrency scenarios, increasing the number of Observer nodes can help improve the cluster's connection capacity.

BE Node Count​

BE nodes are responsible for data storage and computation. In production environments, to ensure data reliability and fault tolerance, 3 copies of data are usually stored. Therefore, it is recommended to deploy at least 3 BE nodes.

BE nodes support horizontal scaling, and by increasing the number of BE nodes, the query performance and concurrent processing capabilities of the cluster can be effectively improved.