Simplifying MySQL Sharding with ProxySQL: A Step-by-Step Guide

Vivek Thakkar

Cloud & DevOps

Tags:

MySQL

ProxySQL

devops

Database Sharding

Introduction:

ProxySQL is a powerful SQL-aware proxy designed to sit between database servers and client applications, optimizing database traffic with features like load balancing, query routing, and failover. This article focuses on simplifying the setup of ProxySQL, especially for users implementing data-based sharding in a MySQL database.

What is Sharding?

Sharding involves partitioning a database into smaller, more manageable pieces called shards based on certain criteria, such as data attributes. ProxySQL supports data-based sharding, allowing users to distribute data across different shards based on specific conditions.

Understanding the Need for ProxySQL:

ProxySQL is an intermediary layer that enhances database management, monitoring, and optimization. With features like data-based sharding, ProxySQL is an ideal solution for scenarios where databases need to be distributed based on specific data attributes, such as geographic regions.

‍Installation & Setup:‍

There are two ways to install the proxy, either by installing it using packages or running ProxySQL in docker. ProxySQL can be installed using two methods: via packages or running it in a Docker container. For this guide, we will focus on the Docker installation.

1. Install ProxySQL and MySQL Docker Images:

To start, pull the necessary Docker images for ProxySQL and MySQL using the following commands:

	docker pull mysql:latest
	docker pull proxysql/proxysql

view raw .sh hosted with ❤ by GitHub

2. Create Docker Network:

Create a Docker network for communication between MySQL containers:

docker network create multi-tenant-network

view raw .sh hosted with ❤ by GitHub

Note: ProxySQL setup will need connections to multiple SQL servers. So, we will set up multiple SQL servers on our docker inside a Docker network.

Containers within the same Docker network can communicate with each other using their container names or IP addresses.

You can check the list of all the Docker networks currently present by running the following command:

docker network ls

view raw .sh hosted with ❤ by GitHub

3. Set Up MySQL Containers:

Now, create three MySQL containers within the network:

Note: We can create any number of MySQL containers.

	docker run -d --name mysql_host_1 --network=multi-tenant-network -p 3307:3306 -e MYSQL_ROOT_PASSWORD=pass123 mysql:latest
	docker run -d --name mysql_host_2 --network=multi-tenant-network -p 3308:3306 -e MYSQL_ROOT_PASSWORD=pass123 mysql:latest
	docker run -d --name mysql_host_3 --network=multi-tenant-network -p 3309:3306 -e MYSQL_ROOT_PASSWORD=pass123 mysql:latest

view raw .sh hosted with ❤ by GitHub

Note: Adjust port numbers as necessary.

The default MySQL protocol port is 3306, but since we cannot access all three of our MySQL containers on the same port, we have set their ports to 3307, 3308, and 3309. Although internally, all MySQL containers will connect using port 3306.

--network=multi-tenant-network. This specifies that the container should be created under the specified network.

We have also specified the root password of the MySQL container to log into it, where the username is “root” and the password is “pass123” for all three of them.

After running the above three commands, three MySQL containers will start running inside the network. You can connect to these three hosts using host = localhost or 127.0.0.1 and port = 3307 / 3308 / 3309.

To ping the port, use the following command:

for macOS:

nc -zv 127.0.0.1 3307

view raw .sh hosted with ❤ by GitHub

for Windows:

ping 127.0.0.1 3307

view raw .sh hosted with ❤ by GitHub

for Linux:

telnet 127.0.0.1 3307

view raw .sh hosted with ❤ by GitHub

‍

4. Create Users in MySQL Containers:

Create “user_shard” and “monitor” users in each MySQL container.

The “user_shard” user will be used by the proxy to make queries to the DB.

The “monitor” user will be used by the proxy to monitor the DB.

Note: To access the MySQL container mysql_host_1, use the command:

docker exec -it mysql_host_1 mysql -uroot -ppass123

view raw .sh hosted with ❤ by GitHub

Use the following commands inside the MySQL container to create the user:‍‍

	CREATE USER 'user_shard'@'%' IDENTIFIED BY 'pass123';
	GRANT ALL PRIVILEGES ON . TO 'user_shard'@'%' WITH GRANT OPTION;
	FLUSH PRIVILEGES;

	CREATE USER monitor@'%' IDENTIFIED BY 'pass123';
	GRANT ALL PRIVILEGES ON . TO monitor@'%' WITH GRANT OPTION;
	FLUSH PRIVILEGES;

view raw .sql hosted with ❤ by GitHub

Repeat the above steps for mysql_host_2 & mysql_host_3.‍

If, at any point, you need to drop the user, you can use the following command:

DROP USER monitor@’%’;

view raw .sql hosted with ❤ by GitHub

5. Prepare ProxySQL Configuration:

To prepare the configuration, we will need the IP addresses of the MySQL containers. To find those, we can use the following command:

	docker inspect mysql_host_1;
	docker inspect mysql_host_2;
	docker inspect mysql_host_3;

view raw .sh hosted with ❤ by GitHub

By running these commands, you will get all the details of the MySQL Docker container under a field named “IPAddress” inside your network. That is the IP address of that particular MySQL container.

Example:
mysql_host_1: 172.19.0.2‍

mysql_host_2: 172.19.0.3‍

mysql_host_3: 172.19.0.4

Reference image for IP address of *mysql_host_1*: 172.19.0.2

Now, create a ProxySQL configuration file named proxysql.cnf. Include details such as IP addresses of MySQL containers, administrative credentials, and MySQL users.

Below is the content that needs to be added to the proxysql.cnf file:

	datadir="/var/lib/proxysql"

	admin_variables=
	{
	admin_credentials="admin:admin;radmin:radmin"
	mysql_ifaces="0.0.0.0:6032"
	refresh_interval=2000
	hash_passwords=false
	}

	mysql_variables=
	{
	threads=4
	max_connections=2048
	default_query_delay=0
	default_query_timeout=36000000
	have_compress=true
	poll_timeout=2000
	interfaces="0.0.0.0:6033;/tmp/proxysql.sock"
	default_schema="information_schema"
	stacksize=1048576
	server_version="5.1.30"
	connect_timeout_server=10000
	monitor_history=60000
	monitor_connect_interval=200000
	monitor_ping_interval=200000
	ping_interval_server_msec=10000
	ping_timeout_server=200
	commands_stats=true
	sessions_sort=true
	monitor_username="monitor"
	monitor_password="pass123"
	}

	mysql_servers =
	(
	{ address="172.19.0.2" , port=3306 , hostgroup=10, max_connections=100 },
	{ address="172.19.0.3" , port=3306 , hostgroup=20, max_connections=100 },
	{ address="172.19.0.4" , port=3306 , hostgroup=30, max_connections=100 }
	)


	mysql_users =
	(
	{ username = "user_shard" , password = "pass123" , default_hostgroup = 10 , active = 1 },
	{ username = "user_shard" , password = "pass123" , default_hostgroup = 20 , active = 1 },
	{ username = "user_shard" , password = "pass123" , default_hostgroup = 30 , active = 1 }
	)

view raw .sh hosted with ❤ by GitHub

Most of the settings are default; we won’t go into much detail for each setting.

admin_variables: These variables are used for ProxySQL's administrative interface. It allows you to connect to ProxySQL and perform administrative tasks such as configuring runtime settings, managing servers, and monitoring performance.

mysql_variables, monitor_username, and monitor_password are used to specify the username that ProxySQL will use when connecting to MySQL servers for monitoring purposes. This monitoring user is used to execute queries and gather statistics about the health and performance of the MySQL servers. This is the user we created during step 4.

mysql_servers will contain all the MySQL servers we want to be connected with ProxySQL. Each entry will have the IP address of the MySQL container, port, host group, and max_connections. Mysql_users will have all the users we created during step 4.

7. Run ProxySQL Container:

Inside the same directory where the proxysql.cnf file is located, run the following command to start ProxySQL:

docker run -d --rm -p 6032:6032 -p 6033:6033 -p 6080:6080 --name=proxysql --network=multi-tenant-network -v $PWD/proxysql.cnf:/etc/proxysql.cnf proxysql/proxysql

view raw .sh hosted with ❤ by GitHub

Here, port 6032 is used for ProxySQL's administrative interface. It allows you to connect to ProxySQL and perform administrative tasks such as configuring runtime settings, managing servers, and monitoring performance.

Port 6033 is the default port for ProxySQL's MySQL protocol interface. It is used for handling MySQL client connections. Our application will use it to access the ProxySQL db and make SQL queries.

The above command will make ProxySQL run on our Docker with the configuration provided in the proxysql.cnf file.

Inside ProxySQL Container:

8. Access ProxySQL Admin Console:

Now, to access the ProxySQL Docker container, use the following command:

docker exec -it proxysql bash

view raw .sh hosted with ❤ by GitHub

Now, once you’re inside the ProxySQL Docker container, you can access the ProxySQL admin console using the command:

mysql -u admin -padmin -h 127.0.0.1 -P 6032

view raw .sh hosted with ❤ by GitHub

You can run the following queries to get insights into your ProxySQL server:

i) To get the list of all the connected MySQL servers:

SELECT * FROM mysql_servers;

view raw .sql hosted with ❤ by GitHub

ii) Verify the status of the MySQL backends in the monitor database tables in ProxySQL admin using the following command:

SHOW TABLES FROM monitor;

view raw .sql hosted with ❤ by GitHub

If this returns an empty set, it means that the monitor username and password are not set correctly. You can do so by using the below commands:

	UPDATE global_variables SET variable_value=’monitor’ WHERE variable_name='mysql-monitor_username';
	UPDATE global_variables SET variable_value=’pass123’ WHERE variable_name='mysql-monitor_password';
	LOAD MYSQL VARIABLES TO RUNTIME;
	SAVE MYSQL VARIABLES TO DISK;

view raw .sql hosted with ❤ by GitHub

And then restart the proxy Docker container:

iii) Check the status of DBs connected to ProxySQL using the following command:

SELECT * FROM monitor.mysql_server_connect_log ORDER BY time_start_us DESC;

view raw .sql hosted with ❤ by GitHub

iv) To get a list of all the ProxySQL global variables, use the following command:

SELECT * FROM global_variables;

view raw .sql hosted with ❤ by GitHub

v) To get all the queries made on ProxySQL, use the following command:

Select * from stats_mysql_query_digest;

view raw .sql hosted with ❤ by GitHub

Note: Whenever we change any row, use the below commands to load them:

Change in variables:

	LOAD MYSQL VARIABLES TO RUNTIME;
	SAVE MYSQL VARIABLES TO DISK;

	Change in mysql_servers:
	LOAD MYSQL SERVERS TO RUNTIME;
	SAVE MYSQL SERVERS TO DISK;

	Change in mysql_query_rules:
	LOAD MYSQL QUERY RULES TO RUNTIME;
	SAVE MYSQL QUERY RULES TO DISK;

view raw .sql hosted with ❤ by GitHub

And then restart the proxy docker container.

IMPORTANT:

To connect to ProxySQL’s admin console, first get into the Docker container using the following command:

docker exec -it proxysql bash

view raw .sh hosted with ❤ by GitHub

Then, to access the ProxySQL admin console, use the following command:

mysql -u admin -padmin -h 127.0.0.1 -P6032

view raw .sh hosted with ❤ by GitHub

To access the ProxySQL MySQL console, we can directly access it using the following command without going inside the Docker ProxySQL container:

mysql -u user_shard -ppass123 -h 127.0.0.1 -P6033

view raw .sh hosted with ❤ by GitHub

To make queries to the database, we make use of ProxySQL’s 6033 port, where MySQL is being accessed.

9. Define Query Rules:

We can add custom query rules inside the mysql_query_rules table to redirect queries to specific databases based on defined patterns. Load the rules to runtime and save to disk.

12. Sharding Example:

Now, let's illustrate how to leverage ProxySQL's data-based sharding capabilities through a practical example. We'll create three MySQL containers, each containing data from different continents in the “world” database, specifically within the “countries” table.

Step 1: Create 3 MySQL containers named mysql_host_1, mysql_host_2 & mysql_host_3.

Inside all containers, create a database named “world” with a table named “countries”.

i) Inside mysql_host_1: Insert countries using the following query:

INSERT INTO `countries` VALUES (1,'India','Asia'),(2,'Japan','Asia'),(3,'China','Asia'),(4,'USA','North America'),(5,'Cuba','North America'),(6,'Honduras','North America');

view raw .sql hosted with ❤ by GitHub

ii) Inside mysql_host_2: Insert countries using the following query:

INSERT INTO `countries` VALUES (1,'Kenya','Africa'),(2,'Ghana','Africa'),(3,'Morocco','Africa'),(4, "Brazil", "South America"), (5, "Chile", "South America"), (6, "Morocco", "South America");

view raw .sql hosted with ❤ by GitHub

iii) Inside mysql_host_3: Insert countries using the following query:

Now, we have distinct data sets for Asia & North America in mysql_host_1, Africa & South America in mysql_host_2, and Europe in mysql_host_3.

Step 2: Define Query Rules for Sharding

Let's create custom query rules to redirect queries based on the continent specified in the SQL statement.

For example, if the query contains the continent “Asia,” we want it to be directed to mysql_host_1.

-- Query Rule for Asia and North America

INSERT INTO mysql_query_rules (rule_id, active, username, match_pattern, destination_hostgroup, apply) VALUES (10, 1, 'user_shard', "\s*continent\s*=\s*.*?(Asia|North America).*?\s*", 10, 0);

view raw .sql hosted with ❤ by GitHub

-- Query Rule for Africa and South America

INSERT INTO mysql_query_rules (rule_id, active, username, match_pattern, destination_hostgroup, apply) VALUES (20, 1, 'user_shard', "\s*continent\s*=\s*.*?(Africa|South America).*?\s*", 20, 0);

view raw .sql hosted with ❤ by GitHub

-- Query Rule for Europe ‍

INSERT INTO mysql_query_rules (rule_id, active, username, match_pattern, destination_hostgroup, apply) VALUES (30, 1, 'user_shard', "\s*continent\s*=\s*.*?(Europe).*?\s*", 30, 0);

view raw .sql hosted with ❤ by GitHub

Step 3: Apply and Save Query Rules

After adding the query rules, ensure they take effect by running the following commands:‍

	LOAD MYSQL QUERY RULES TO RUNTIME;
	SAVE MYSQL QUERY RULES TO DISK;

view raw .sql hosted with ❤ by GitHub

Now, access the MySQL server using the ProxySQL port and execute queries:

mysql -u user_shard -ppass123 -h 127.0.0.1 -P 6033

view raw .sh hosted with ❤ by GitHub

use world;

view raw .sql hosted with ❤ by GitHub

-- Example Queries:

Select * from countries where id = 1 and continent = "Asia";

view raw .sql hosted with ❤ by GitHub

-- This will return id=1, name=India, continent=Asia

Select * from countries where id = 1 and continent = "Africa";

view raw .sql hosted with ❤ by GitHub

-- This will return id=1, name=Kenya, continent=Africa.

Select * from countries where id = 1 and continent = "Africa";

view raw .sql hosted with ❤ by GitHub

Based on the defined query rules, the queries will be redirected to the specified MySQL host groups. If no rules match, the default host group that’s specified in mysql_users inside proxysql.cnf will be used.

Conclusion:

ProxySQL simplifies access to distributed data through effective sharding strategies. Its flexible query rules, combined with regex patterns and host group definitions, offer significant flexibility with relative simplicity.

By following this step-by-step guide, users can quickly set up ProxySQL and leverage its capabilities to optimize database performance and achieve efficient data distribution.

References:

Download and Install ProxySQL - ProxySQL

How to configure ProxySQL for the first time - ProxySQL

Admin Variables - ProxySQL‍

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

Simplifying MySQL Sharding with ProxySQL: A Step-by-Step Guide

Introduction:

What is Sharding?

Understanding the Need for ProxySQL:

‍Installation & Setup:‍

1. Install ProxySQL and MySQL Docker Images:

To start, pull the necessary Docker images for ProxySQL and MySQL using the following commands:

	docker pull mysql:latest
	docker pull proxysql/proxysql

view raw .sh hosted with ❤ by GitHub

2. Create Docker Network:

Create a Docker network for communication between MySQL containers:

docker network create multi-tenant-network

view raw .sh hosted with ❤ by GitHub

Note: ProxySQL setup will need connections to multiple SQL servers. So, we will set up multiple SQL servers on our docker inside a Docker network.

Containers within the same Docker network can communicate with each other using their container names or IP addresses.

You can check the list of all the Docker networks currently present by running the following command:

docker network ls

view raw .sh hosted with ❤ by GitHub

3. Set Up MySQL Containers:

Now, create three MySQL containers within the network:

Note: We can create any number of MySQL containers.

	docker run -d --name mysql_host_1 --network=multi-tenant-network -p 3307:3306 -e MYSQL_ROOT_PASSWORD=pass123 mysql:latest
	docker run -d --name mysql_host_2 --network=multi-tenant-network -p 3308:3306 -e MYSQL_ROOT_PASSWORD=pass123 mysql:latest
	docker run -d --name mysql_host_3 --network=multi-tenant-network -p 3309:3306 -e MYSQL_ROOT_PASSWORD=pass123 mysql:latest

view raw .sh hosted with ❤ by GitHub

Note: Adjust port numbers as necessary.

--network=multi-tenant-network. This specifies that the container should be created under the specified network.

We have also specified the root password of the MySQL container to log into it, where the username is “root” and the password is “pass123” for all three of them.

To ping the port, use the following command:

for macOS:

nc -zv 127.0.0.1 3307

view raw .sh hosted with ❤ by GitHub

for Windows:

ping 127.0.0.1 3307

view raw .sh hosted with ❤ by GitHub

for Linux:

telnet 127.0.0.1 3307

view raw .sh hosted with ❤ by GitHub

‍

4. Create Users in MySQL Containers:

Create “user_shard” and “monitor” users in each MySQL container.

The “user_shard” user will be used by the proxy to make queries to the DB.

The “monitor” user will be used by the proxy to monitor the DB.

Note: To access the MySQL container mysql_host_1, use the command:

docker exec -it mysql_host_1 mysql -uroot -ppass123

view raw .sh hosted with ❤ by GitHub

Use the following commands inside the MySQL container to create the user:‍‍

	CREATE USER 'user_shard'@'%' IDENTIFIED BY 'pass123';
	GRANT ALL PRIVILEGES ON . TO 'user_shard'@'%' WITH GRANT OPTION;
	FLUSH PRIVILEGES;

	CREATE USER monitor@'%' IDENTIFIED BY 'pass123';
	GRANT ALL PRIVILEGES ON . TO monitor@'%' WITH GRANT OPTION;
	FLUSH PRIVILEGES;

view raw .sql hosted with ❤ by GitHub

Repeat the above steps for mysql_host_2 & mysql_host_3.‍

If, at any point, you need to drop the user, you can use the following command:

DROP USER monitor@’%’;

view raw .sql hosted with ❤ by GitHub

5. Prepare ProxySQL Configuration:

To prepare the configuration, we will need the IP addresses of the MySQL containers. To find those, we can use the following command:

	docker inspect mysql_host_1;
	docker inspect mysql_host_2;
	docker inspect mysql_host_3;

view raw .sh hosted with ❤ by GitHub

Example:
mysql_host_1: 172.19.0.2‍

mysql_host_2: 172.19.0.3‍

mysql_host_3: 172.19.0.4

Now, create a ProxySQL configuration file named proxysql.cnf. Include details such as IP addresses of MySQL containers, administrative credentials, and MySQL users.

Below is the content that needs to be added to the proxysql.cnf file:

	datadir="/var/lib/proxysql"

	admin_variables=
	{
	admin_credentials="admin:admin;radmin:radmin"
	mysql_ifaces="0.0.0.0:6032"
	refresh_interval=2000
	hash_passwords=false
	}

	mysql_variables=
	{
	threads=4
	max_connections=2048
	default_query_delay=0
	default_query_timeout=36000000
	have_compress=true
	poll_timeout=2000
	interfaces="0.0.0.0:6033;/tmp/proxysql.sock"
	default_schema="information_schema"
	stacksize=1048576
	server_version="5.1.30"
	connect_timeout_server=10000
	monitor_history=60000
	monitor_connect_interval=200000
	monitor_ping_interval=200000
	ping_interval_server_msec=10000
	ping_timeout_server=200
	commands_stats=true
	sessions_sort=true
	monitor_username="monitor"
	monitor_password="pass123"
	}

	mysql_servers =
	(
	{ address="172.19.0.2" , port=3306 , hostgroup=10, max_connections=100 },
	{ address="172.19.0.3" , port=3306 , hostgroup=20, max_connections=100 },
	{ address="172.19.0.4" , port=3306 , hostgroup=30, max_connections=100 }
	)


	mysql_users =
	(
	{ username = "user_shard" , password = "pass123" , default_hostgroup = 10 , active = 1 },
	{ username = "user_shard" , password = "pass123" , default_hostgroup = 20 , active = 1 },
	{ username = "user_shard" , password = "pass123" , default_hostgroup = 30 , active = 1 }
	)

view raw .sh hosted with ❤ by GitHub

Most of the settings are default; we won’t go into much detail for each setting.

7. Run ProxySQL Container:

Inside the same directory where the proxysql.cnf file is located, run the following command to start ProxySQL:

docker run -d --rm -p 6032:6032 -p 6033:6033 -p 6080:6080 --name=proxysql --network=multi-tenant-network -v $PWD/proxysql.cnf:/etc/proxysql.cnf proxysql/proxysql

view raw .sh hosted with ❤ by GitHub

The above command will make ProxySQL run on our Docker with the configuration provided in the proxysql.cnf file.

Inside ProxySQL Container:

8. Access ProxySQL Admin Console:

Now, to access the ProxySQL Docker container, use the following command:

docker exec -it proxysql bash

view raw .sh hosted with ❤ by GitHub

Now, once you’re inside the ProxySQL Docker container, you can access the ProxySQL admin console using the command:

mysql -u admin -padmin -h 127.0.0.1 -P 6032

view raw .sh hosted with ❤ by GitHub

You can run the following queries to get insights into your ProxySQL server:

i) To get the list of all the connected MySQL servers:

SELECT * FROM mysql_servers;

view raw .sql hosted with ❤ by GitHub

ii) Verify the status of the MySQL backends in the monitor database tables in ProxySQL admin using the following command:

SHOW TABLES FROM monitor;

view raw .sql hosted with ❤ by GitHub

If this returns an empty set, it means that the monitor username and password are not set correctly. You can do so by using the below commands:

	UPDATE global_variables SET variable_value=’monitor’ WHERE variable_name='mysql-monitor_username';
	UPDATE global_variables SET variable_value=’pass123’ WHERE variable_name='mysql-monitor_password';
	LOAD MYSQL VARIABLES TO RUNTIME;
	SAVE MYSQL VARIABLES TO DISK;

view raw .sql hosted with ❤ by GitHub

And then restart the proxy Docker container:

iii) Check the status of DBs connected to ProxySQL using the following command:

SELECT * FROM monitor.mysql_server_connect_log ORDER BY time_start_us DESC;

view raw .sql hosted with ❤ by GitHub

iv) To get a list of all the ProxySQL global variables, use the following command:

SELECT * FROM global_variables;

view raw .sql hosted with ❤ by GitHub

v) To get all the queries made on ProxySQL, use the following command:

Select * from stats_mysql_query_digest;

view raw .sql hosted with ❤ by GitHub

Note: Whenever we change any row, use the below commands to load them:

Change in variables:

	LOAD MYSQL VARIABLES TO RUNTIME;
	SAVE MYSQL VARIABLES TO DISK;

	Change in mysql_servers:
	LOAD MYSQL SERVERS TO RUNTIME;
	SAVE MYSQL SERVERS TO DISK;

	Change in mysql_query_rules:
	LOAD MYSQL QUERY RULES TO RUNTIME;
	SAVE MYSQL QUERY RULES TO DISK;

view raw .sql hosted with ❤ by GitHub

And then restart the proxy docker container.

IMPORTANT:

To connect to ProxySQL’s admin console, first get into the Docker container using the following command:

docker exec -it proxysql bash

view raw .sh hosted with ❤ by GitHub

Then, to access the ProxySQL admin console, use the following command:

mysql -u admin -padmin -h 127.0.0.1 -P6032

view raw .sh hosted with ❤ by GitHub

To access the ProxySQL MySQL console, we can directly access it using the following command without going inside the Docker ProxySQL container:

mysql -u user_shard -ppass123 -h 127.0.0.1 -P6033

view raw .sh hosted with ❤ by GitHub

To make queries to the database, we make use of ProxySQL’s 6033 port, where MySQL is being accessed.

9. Define Query Rules:

We can add custom query rules inside the mysql_query_rules table to redirect queries to specific databases based on defined patterns. Load the rules to runtime and save to disk.

Inside all containers, create a database named “world” with a table named “countries”.

i) Inside mysql_host_1: Insert countries using the following query:

INSERT INTO `countries` VALUES (1,'India','Asia'),(2,'Japan','Asia'),(3,'China','Asia'),(4,'USA','North America'),(5,'Cuba','North America'),(6,'Honduras','North America');

view raw .sql hosted with ❤ by GitHub

ii) Inside mysql_host_2: Insert countries using the following query:

INSERT INTO `countries` VALUES (1,'Kenya','Africa'),(2,'Ghana','Africa'),(3,'Morocco','Africa'),(4, "Brazil", "South America"), (5, "Chile", "South America"), (6, "Morocco", "South America");

view raw .sql hosted with ❤ by GitHub

iii) Inside mysql_host_3: Insert countries using the following query:

Now, we have distinct data sets for Asia & North America in mysql_host_1, Africa & South America in mysql_host_2, and Europe in mysql_host_3.

Step 2: Define Query Rules for Sharding

Let's create custom query rules to redirect queries based on the continent specified in the SQL statement.

For example, if the query contains the continent “Asia,” we want it to be directed to mysql_host_1.

-- Query Rule for Asia and North America

INSERT INTO mysql_query_rules (rule_id, active, username, match_pattern, destination_hostgroup, apply) VALUES (10, 1, 'user_shard', "\s*continent\s*=\s*.*?(Asia|North America).*?\s*", 10, 0);

view raw .sql hosted with ❤ by GitHub

-- Query Rule for Africa and South America

INSERT INTO mysql_query_rules (rule_id, active, username, match_pattern, destination_hostgroup, apply) VALUES (20, 1, 'user_shard', "\s*continent\s*=\s*.*?(Africa|South America).*?\s*", 20, 0);

view raw .sql hosted with ❤ by GitHub

-- Query Rule for Europe ‍

INSERT INTO mysql_query_rules (rule_id, active, username, match_pattern, destination_hostgroup, apply) VALUES (30, 1, 'user_shard', "\s*continent\s*=\s*.*?(Europe).*?\s*", 30, 0);

view raw .sql hosted with ❤ by GitHub

Step 3: Apply and Save Query Rules

After adding the query rules, ensure they take effect by running the following commands:‍

	LOAD MYSQL QUERY RULES TO RUNTIME;
	SAVE MYSQL QUERY RULES TO DISK;

view raw .sql hosted with ❤ by GitHub

Now, access the MySQL server using the ProxySQL port and execute queries:

mysql -u user_shard -ppass123 -h 127.0.0.1 -P 6033

view raw .sh hosted with ❤ by GitHub

use world;

view raw .sql hosted with ❤ by GitHub

-- Example Queries:

Select * from countries where id = 1 and continent = "Asia";

view raw .sql hosted with ❤ by GitHub

-- This will return id=1, name=India, continent=Asia

Select * from countries where id = 1 and continent = "Africa";

view raw .sql hosted with ❤ by GitHub

-- This will return id=1, name=Kenya, continent=Africa.

Select * from countries where id = 1 and continent = "Africa";

view raw .sql hosted with ❤ by GitHub

Conclusion:

By following this step-by-step guide, users can quickly set up ProxySQL and leverage its capabilities to optimize database performance and achieve efficient data distribution.

References:

Download and Install ProxySQL - ProxySQL

How to configure ProxySQL for the first time - ProxySQL

Admin Variables - ProxySQL‍

About the Author

Did you like the blog? If yes, we're sure you'll also like to work with the people who write them - our best-in-class engineering team.

We're looking for talented developers who are passionate about new emerging technologies. If that's you, get in touch with us.

Explore current openings

Subscribe to get the latest technology updates

Simplifying MySQL Sharding with ProxySQL: A Step-by-Step Guide

Vivek Thakkar

Introduction:

What is Sharding?

Understanding the Need for ProxySQL:

‍Installation & Setup:‍

Inside ProxySQL Container:

IMPORTANT:

Conclusion:

References:

MORE POSTS BY THIS AUTHOR

Vivek Thakkar

You may also like

Linux Internals of Kubernetes Networking

Shiwam Jaiswal

Strategies for Cost Optimization Across Amazon EKS Clusters

Saurabh Taneja

Mastering Prow: A Guide to Developing Your Own Plugin for Kubernetes CI/CD Workflow

Bhavya Jain

Simplifying MySQL Sharding with ProxySQL: A Step-by-Step Guide

Introduction:

What is Sharding?

Understanding the Need for ProxySQL:

‍Installation & Setup:‍

Inside ProxySQL Container:

IMPORTANT:

Conclusion:

References:

About the Author

Did you like the blog? If yes, we're sure you'll also like to work with the people who write them - our best-in-class engineering team.

We're looking for talented developers who are passionate about new emerging technologies. If that's you, get in touch with us.

About Velotio

Subscribe to get the latest technology updates

Related Posts

Linux Internals of Kubernetes Networking

Strategies for Cost Optimization Across Amazon EKS Clusters

Mastering Prow: A Guide to Developing Your Own Plugin for Kubernetes CI/CD Workflow

Streamline Kubernetes Storage Upgrades

Unlocking Key Insights in NATS Development: My Journey from Novice to Expert - Part 1

Unveiling the Magic of Kubernetes: Exploring Pod Priority, Priority Classes, and Pod Preemption

How to deploy GitHub Actions Self-Hosted Runners on Kubernetes

How to Setup HashiCorp Vault HA Cluster with Integrated Storage (Raft)

How To Get Started With Logging On Kubernetes?

Create CI/CD Pipeline in GitLab in under 10 mins

Acquiring Temporary AWS Credentials with Browser Navigated Authentication

How to Avoid Screwing Up CI/CD: Best Practices for DevOps Team

How to Make Your Terminal More Productive with Z-Shell (ZSH)

Setting Up A Robust Authentication Environment For OpenSSH Using QR Code PAM

Hacking Your Way Around AWS IAM Roles

Monitoring a Docker Container with Elasticsearch, Kibana, and Metricbeat

Autoscaling in Kubernetes using HPA and VPA

Managing a TLS Certificate for Kubernetes Admission Webhook

Prow + Kubernetes - A Perfect Combination To Execute CI/CD At Scale

Building A Containerized Microservice in Golang: A Step-by-step Guide

Kubernetes Migration: How To Move Data Freely Across Clusters

OPA On Kubernetes: An Introduction For Beginners

To Go Serverless Or Not Is The Question

Ensure Continuous Delivery On Kubernetes With GitOps’ Argo CD

How To Implement Chaos Engineering For Microservices Using Istio

Helm 3: A More Secured and Simpler Kubernetes Package Manager

An Introduction To Cloudflare Workers And Cloudflare KV store

Getting Started With Kubernetes Operators (Golang Based) - Part 3

Getting Started With Kubernetes Operators (Ansible Based) - Part 2

Getting Started With Kubernetes Operators (Helm Based) - Part 1

How to Write Jenkinsfile for Angular and .Net Based Applications

Kubernetes CSI in Action: Explained with Features and Use Cases

A Comprehensive Tutorial to Implementing OpenTracing With Jaeger

The Ultimate Guide to Disaster Recovery for Your Kubernetes Clusters

Know Everything About Spinnaker & How to Deploy Using Kubernetes Engine

Mesosphere DC/OS Masterclass : Tips and Tricks to Make Life Easier

Managing Secrets Using AWS Systems Manager Parameter Store and IAM Roles

Taking Amazon's Elastic Kubernetes Service for a Spin

Extending Kubernetes APIs with Custom Resource Definitions (CRDs)

Jenkins X - A Cloud-native Approach to CI/CD

Demystifying High Availability in Kubernetes Using Kubeadm

Exploring Upgrade Strategies for Stateful Sets in Kubernetes

Learn How to Quickly Setup Istio Using GKE and its Applications

Continuous Deployment with Azure Kubernetes Service, Azure Container Registry & Jenkins

Tutorial: Developing Complex Plugins for Jenkins