INIT: First commit with the script and all necessary files

2025-12-16 13:22:27 +01:00
parent a1fcf8f198
commit c195118e56
8 changed files with 929950 additions and 1 deletions
--- a/.gitignore
+++ b/.gitignore
@@ -3,4 +3,4 @@ venv/
 export/
 *_host_ids.txt
 *.log
-backups/
+partitioning/tests/
--- a/partitioning/70-schema-mysql.txt
+++ b/partitioning/70-schema-mysql.txt
--- a/partitioning/70-schema-postgres.txt
+++ b/partitioning/70-schema-postgres.txt
--- a/partitioning/74-schema-mysql.txt
+++ b/partitioning/74-schema-mysql.txt
--- a/partitioning/74-schema-postgres.txt
+++ b/partitioning/74-schema-postgres.txt
--- a/partitioning/README.md
+++ b/partitioning/README.md
@@ -0,0 +1,164 @@
 # Zabbix Database Partitioning Guide (Python based)
 This guide describes how to set up and manage database partitioning for Zabbix using the `zabbix_partitioning.py` script. 
 ## Overview
 The script manages MySQL table partitions based on time (Range Partitioning on the `clock` column). It automatically:
 1.  Creates future partitions to ensure new data can be written.
 2.  Drops old partitions based on configured retention periods.
 **Benefits**:
 - **Performance**: Faster cleanup of old data (dropping a partition is instantaneous compared to Zabbix internal housekeeping).
 - **Recommended**: For database bigger than 100GB.
 - **Must have!**: For database bigger than 500G.
 > [!WARNING]
 > Support for **MySQL/MariaDB** only.
 > Always **BACKUP** your database before initializing partitioning!
 ---
 ## 1. Prerequisites
 - **Python 3.6+**
 - **Python Libraries**: `pymysql`, `pyyaml`
  ```bash
  # Debian/Ubuntu
  sudo apt install python3-pymysql python3-yaml
  # RHEL/AlmaLinux/Rocky
  sudo dnf install python3-pymysql python3-pyyaml
  # Or via pip
  pip3 install pymysql pyyaml
  ```
 - **Database Permissions**: The user configured in the script needs:
  - `SELECT`, `INSERT`, `CREATE`, `DROP`, `ALTER` on the Zabbix database.
  - `SUPER` or `SESSION_VARIABLES_ADMIN` privilege (required to disable binary logging via `SET SESSION sql_log_bin=0` if `replicate_sql: False`).
 ---
 ## 2. Installation
 1.  Copy the script and config to a precise location (e.g., `/usr/local/bin` or specialized directory).
    ```bash
    mkdir -p /opt/zabbix_partitioning
    cp zabbix_partitioning.py /opt/zabbix_partitioning/
    cp zabbix_partitioning.conf /etc/zabbix/
    chmod +x /opt/zabbix_partitioning/zabbix_partitioning.py
    ```
 ---
 ## 3. Configuration
 Edit `/etc/zabbix/zabbix_partitioning.conf`:
 ```yaml
 database:
    host: localhost
    user: zbx_part
    passwd: YOUR_PASSWORD
    db: zabbix
    # port: 3306  # Optional, default is 3306
 partitions:
    daily:
        - history: 14d
        - history_uint: 14d
        - trends: 365d
        # ... add other tables as needed
 ```
 ### Important Notes:
 - **`replicate_sql`**:
  - `False` (Default): Partitioning maintenance commands are NOT replicated to slaves. Recommended if you manage partitions separately on each node or want to reduce replication lag.
  - `True`: Commands are replicated.
 - **`auditlog`**:
  - In Zabbix 7.0+, the `auditlog` table does **not** have the `clock` column in its Primary Key by default. **Do not** add it to the config unless you have manually altered the table schema.
 ---
 ## 4. Zabbix Preparation (CRITICAL)
 Before partitioning, you **must disable** Zabbix's internal housekeeping for the tables you intend to partition. If you don't, Zabbix will try to delete individual rows while the script tries to drop partitions, causing conflicts.
 1.  Log in to Zabbix Web Interface.
 2.  Go to **Administration** -> **General** -> **Housekeeping**.
 3.  **Uncheck** the following (depending on what you partition):
    - [ ] Enable internal housekeeping for **History**
    - [ ] Enable internal housekeeping for **Trends**
 4.  Click **Update**.
 ---
 ## 5. Initialization
 This step converts existing standard tables into partitioned tables.
 1.  **Dry Run** (Verify what will happen):
    ```bash
    /opt/zabbix_partitioning/zabbix_partitioning.py --init --dry-run
    ```
    *Check the output for any errors.*
 2.  **Execute Initialization**:
    ```bash
    /opt/zabbix_partitioning/zabbix_partitioning.py --init
    ```
    *This may take time depending on table size.*
 ---
 ## 6. Automation (Cron Job)
 Set up a daily cron job to create new partitions and remove old ones.
 1.  Open crontab:
    ```bash
    crontab -e
    ```
 2.  Add the line (run daily at 00:30):
    ```cron
    30 0 * * * /usr/bin/python3 /opt/zabbix_partitioning/zabbix_partitioning.py -c /etc/zabbix/zabbix_partitioning.conf >> /var/log/zabbix_partitioning.log 2>&1
    ```
 ---
 ## 7. Automation (Systemd Timer) — Recommended
 Alternatively, use systemd timers for more robust scheduling and logging.
 1.  **Create Service Unit** (`/etc/systemd/system/zabbix-partitioning.service`):
    ```ini
    [Unit]
    Description=Zabbix Database Partitioning Service
    After=network.target mysql.service
    [Service]
    Type=oneshot
    User=root
    ExecStart=/usr/bin/python3 /opt/zabbix_partitioning/zabbix_partitioning.py -c /etc/zabbix/zabbix_partitioning.conf
    ```
 2.  **Create Timer Unit** (`/etc/systemd/system/zabbix-partitioning.timer`):
    ```ini
    [Unit]
    Description=Run Zabbix Partitioning Daily
    [Timer]
    OnCalendar=*-*-* 00:30:00
    Persistent=true
    [Install]
    WantedBy=timers.target
    ```
 3.  **Enable and Start**:
    ```bash
    systemctl daemon-reload
    systemctl enable --now zabbix-partitioning.timer
    ```
 4.  **View Logs**:
    ```bash
    journalctl -u zabbix-partitioning.service
    ```
 ---
 ## 8. Troubleshooting
 - **Connection Refused**: Check `host`, `port` in config. Ensure MySQL is running.
 - **Access Denied (1227)**: The DB user needs `SUPER` privileges to disable binary logging (`replicate_sql: False`). Either grant the privilege or set `replicate_sql: True` (if replication load is acceptable).
 - **Primary Key Error**: "Primary Key does not include 'clock'". The table cannot be partitioned by range on `clock` without schema changes. Remove it from config.
--- a/partitioning/zabbix_partitioning.conf
+++ b/partitioning/zabbix_partitioning.conf
@@ -0,0 +1,45 @@
 # database: Connection details for the Zabbix database
 database:
    type: mysql
    # host: Database server hostname or IP
    host: localhost
    # socket: Path to the MySQL unix socket (overrides host if set)
    socket: /var/run/mysqlrouter/mysql_rw.sock
    # port: Database port (default: 3306)
    # port: 3306
    # credentials
    user: zbx_part
    passwd: <password>
    db: zabbix
 # partitions: Define retention periods for tables. 
 # Format: table_name: duration (e.g., 14d, 12w, 1m, 1y)
 partitions:
    # daily: Partitions created daily
    daily:
        - history: 14d
        - history_uint: 14d
        - history_str: 14d
        - history_text: 14d
        - history_log: 14d
        - history_bin: 14d
    # weekly: Partitions created weekly
    weekly:
        # - auditlog: 180d
        # Note: auditlog is not partitionable by default in Zabbix 7.0 and 7.4 (PK missing clock).
        # To partition, the Primary Key must be altered to include 'clock'.
        # https://www.zabbix.com/documentation/current/en/manual/appendix/install/auditlog_primary_keys
    # monthly: Partitions created monthly
    monthly:
        - trends: 1y
        - trends_uint: 1y
 # logging: Where to send log output. Options: syslog, console
 logging: syslog
 # premake: Number of partitions to create in advance
 premake: 10
 # replicate_sql: False - Disable binary logging. Partitioning changes are NOT replicated to slaves (use for independent maintenance).
 # replicate_sql: True - Enable binary logging. Partitioning changes ARE replicated to slaves (use for consistent cluster schema).
 replicate_sql: False
--- a/partitioning/zabbix_partitioning.py
+++ b/partitioning/zabbix_partitioning.py
@@ -0,0 +1,508 @@
 #!/usr/bin/env python3
 # -*- coding: utf-8 -*-
 """
 Zabbix Database Partitioning Management Script
 Refactored for Zabbix 7.x compatibility, better maintainability, and standard logging.
 """
 import os
 import sys
 import re
 import argparse
 import pymysql
 from pymysql.constants import CLIENT
 import yaml
 import logging
 import logging.handlers
 from datetime import datetime, timedelta
 from typing import Optional, Dict, List, Any, Union, Tuple
 from contextlib import contextmanager
 # Semantic Versioning
 VERSION = '0.3.0'
 # Constants
 PART_PERIOD_REGEX = r'([0-9]+)(h|d|m|y)'
 PARTITION_TEMPLATE = 'PARTITION %s VALUES LESS THAN (UNIX_TIMESTAMP("%s") div 1) ENGINE = InnoDB'
 # Custom Exceptions
 class ConfigurationError(Exception):
    pass
 class DatabaseError(Exception):
    pass
 class ZabbixPartitioner:
    def __init__(self, config: Dict[str, Any], dry_run: bool = False):
        self.config = config
        self.dry_run = dry_run
        self.conn = None
        self.logger = logging.getLogger('zabbix_partitioning')
        # Unpack database config
        db_conf = self.config['database']
        self.db_host = db_conf.get('host', 'localhost')
        self.db_port = int(db_conf.get('port', 3306))
        self.db_socket = db_conf.get('socket')
        self.db_user = db_conf['user']
        self.db_password = db_conf.get('passwd')
        self.db_name = db_conf['db']
        self.db_ssl = db_conf.get('ssl')
        self.replicate_sql = self.config.get('replicate_sql', False)
    @contextmanager
    def connect_db(self):
        """Context manager for database connection."""
        try:
            connect_args = {
                'user': self.db_user,
                'password': self.db_password,
                'database': self.db_name,
                'port': self.db_port,
                'cursorclass': pymysql.cursors.Cursor,
                # Enable multi-statements if needed, though we usually run single queries
                'client_flag': CLIENT.MULTI_STATEMENTS
            }
            if self.db_socket:
                connect_args['unix_socket'] = self.db_socket
            else:
                connect_args['host'] = self.db_host
            if self.db_ssl:
                connect_args['ssl'] = self.db_ssl
                # PyMySQL SSL options
                # Note: valid ssl keys for PyMySQL are 'ca', 'capath', 'cert', 'key', 'cipher', 'check_hostname'
            self.logger.info(f"Connecting to database: {self.db_name}")
            self.conn = pymysql.connect(**connect_args)
            # Setup session
            with self.conn.cursor() as cursor:
                cursor.execute('SET SESSION wait_timeout = 86400')
                if not self.replicate_sql:
                    cursor.execute('SET SESSION sql_log_bin = 0')
            yield self.conn
        except pymysql.MySQLError as e:
            self.logger.critical(f"Database connection failed: {e}")
            raise DatabaseError(f"Failed to connect to MySQL: {e}")
        finally:
            if self.conn and self.conn.open:
                self.conn.close()
                self.logger.info("Database connection closed")
    def execute_query(self, query: str, params: Optional[Union[List, Tuple]] = None, fetch: str = 'none') -> Any:
        """
        Execute a query.
        fetch: 'none', 'one', 'all'
        """
        if self.dry_run and not query.lower().startswith('select'):
            self.logger.info(f"[DRY-RUN] Query: {query} | Params: {params}")
            return None
        if not self.conn or not self.conn.open:
            raise DatabaseError("Connection not open")
        try:
            with self.conn.cursor() as cursor:
                if self.logger.level == logging.DEBUG:
                     self.logger.debug(f"Query: {query} | Params: {params}")
                cursor.execute(query, params)
                if fetch == 'one':
                    result = cursor.fetchone()
                    # Return first column if it's a single value result and a tuple
                    if result and isinstance(result, tuple) and len(result) == 1:
                        return result[0]
                    return result
                elif fetch == 'all':
                    return cursor.fetchall()
                self.conn.commit()
                return True
        except pymysql.MySQLError as e:
            self.logger.error(f"SQL Error: {e} | Query: {query}")
            raise DatabaseError(f"SQL Execution Error: {e}")
    # --- Utility Functions --- #
    def truncate_date(self, dt: datetime, period: str) -> datetime:
        """Truncate date to the start of the partitioning period."""
        if period == 'hourly':
            return dt.replace(microsecond=0, second=0, minute=0)
        elif period == 'daily':
            return dt.replace(microsecond=0, second=0, minute=0, hour=0)
        elif period == 'weekly':
             # Monday is 0, Sunday is 6. isoweekday() Mon=1, Sun=7.
             # Truncate to Monday
            dt = dt.replace(microsecond=0, second=0, minute=0, hour=0)
            return dt - timedelta(days=dt.isoweekday() - 1)
        elif period == 'monthly':
            return dt.replace(microsecond=0, second=0, minute=0, hour=0, day=1)
        elif period == 'yearly':
            return dt.replace(microsecond=0, second=0, minute=0, hour=0, day=1, month=1)
        else:
             raise ValueError(f"Unknown period: {period}")
    def get_next_date(self, dt: datetime, period: str, amount: int = 1) -> datetime:
        """Add 'amount' periods to the date."""
        if period == 'hourly':
            return dt + timedelta(hours=amount)
        elif period == 'daily':
            return dt + timedelta(days=amount)
        elif period == 'weekly':
            return dt + timedelta(weeks=amount)
        elif period == 'monthly':
             # Simple month addition
             m, y = (dt.month + amount) % 12, dt.year + ((dt.month + amount - 1) // 12)
             if not m: m = 12
             # Handle end of month days (e.g. Jan 31 + 1 month -> Feb 28) logic not strictly needed for 1st of month
             # but keeping robust
             d = min(dt.day, [31, 29 if y%4==0 and (y%100!=0 or y%400==0) else 28, 31, 30, 31, 30, 31, 31, 30, 31, 30, 31][m-1])
             return dt.replace(day=d, month=m, year=y)
        elif period == 'yearly':
            return dt.replace(year=dt.year + amount)
        else:
            return dt
    def get_lookback_date(self, period_str: str) -> datetime:
        """
        Calculate the retention date based on config string (e.g., "30d", "12m").
        """
        match = re.search(PART_PERIOD_REGEX, period_str)
        if not match:
            raise ConfigurationError(f"Invalid period format: {period_str}")
        amount = int(match.group(1))
        unit = match.group(2)
        now = datetime.now()
        if unit in ['h', 'hourly']:
            return now - timedelta(hours=amount)
        elif unit in ['d', 'daily']:
            return now - timedelta(days=amount)
        elif unit in ['w', 'weekly']:
            return now - timedelta(weeks=amount)
        elif unit in ['m', 'monthly']:
            # approximate 30 days per month for simple calculation or full month subtraction
            # using get_next_date with negative amount
            return self.get_next_date(now, 'monthly', -amount)
        elif unit in ['y', 'yearly']:
            return now.replace(year=now.year - amount)
        return now
    def get_partition_name(self, dt: datetime, period: str) -> str:
        if period == 'hourly':
            return dt.strftime('p%Y_%m_%d_%Hh')
        elif period == 'daily':
            return dt.strftime('p%Y_%m_%d')
        elif period == 'weekly':
            return dt.strftime('p%Y_%Uw')
        elif period == 'monthly':
            return dt.strftime('p%Y_%m')
        return "p_unknown"
    def get_partition_description(self, dt: datetime, period: str) -> str:
        """Generate the partition description (Unix Timestamp) for VALUES LESS THAN."""
        # Partition boundary is the START of the NEXT period
        next_dt = self.get_next_date(dt, period, 1)
        if period == 'hourly':
            fmt = '%Y-%m-%d %H:00:00'
        else:
            fmt = '%Y-%m-%d 00:00:00'
        return next_dt.strftime(fmt)
    # --- Core Logic --- #
    def check_compatibility(self):
        """Verify Zabbix version and partitioning support."""
        # 1. Check MySQL Version
        version_str = self.execute_query('SELECT version()', fetch='one')
        if not version_str:
            raise DatabaseError("Could not determine MySQL version")
        # MySQL 8.0+ supports partitioning natively
        # (Assuming MySQL 8+ or MariaDB 10+ for modern Zabbix)
        self.logger.info(f"MySQL Version: {version_str}")
        # 2. Check Zabbix DB Version (optional info)
        try:
            mandatory = self.execute_query('SELECT `mandatory` FROM `dbversion`', fetch='one')
            if mandatory:
                 self.logger.info(f"Zabbix DB Mandatory Version: {mandatory}")
        except Exception:
             self.logger.warning("Could not read 'dbversion' table. Is this a Zabbix DB?")
    def get_table_min_clock(self, table: str) -> Optional[datetime]:
        ts = self.execute_query(f"SELECT MIN(`clock`) FROM `{table}`", fetch='one')
        return datetime.fromtimestamp(int(ts)) if ts else None
    def get_existing_partitions(self, table: str) -> List[Tuple[str, int]]:
        """Return list of (partition_name, description_timestamp)."""
        query = """
            SELECT `partition_name`, `partition_description`
            FROM `information_schema`.`partitions`
            WHERE `table_schema` = %s AND `table_name` = %s AND `partition_name` IS NOT NULL
            ORDER BY `partition_description` ASC
        """
        rows = self.execute_query(query, (self.db_name, table), fetch='all')
        if not rows:
            return []
        partitions = []
        for row in rows:
            name, desc = row
            # 'desc' is a string or int depending on DB driver, usually unix timestamp for TIMESTAMP partitions
            try:
                partitions.append((name, int(desc)))
            except (ValueError, TypeError):
                pass # MAXVALUE or invalid
        return partitions
    def has_incompatible_primary_key(self, table: str) -> bool:
        """
        Returns True if the table has a Primary Key that DOES NOT include the 'clock' column.
        Partitioning requires the partition column to be part of the Primary/Unique key.
        """
        # 1. Check if PK exists
        pk_exists = self.execute_query(
            """SELECT COUNT(*) FROM `information_schema`.`table_constraints` 
               WHERE `constraint_type` = 'PRIMARY KEY' 
               AND `table_schema` = %s AND `table_name` = %s""",
            (self.db_name, table), fetch='one'
        )
        if not pk_exists:
            # No PK means no restriction on partitioning
            return False
        # 2. Check if 'clock' is in the PK
        clock_in_pk = self.execute_query(
            """SELECT COUNT(*) FROM `information_schema`.`key_column_usage` k
               JOIN `information_schema`.`table_constraints` t USING(`constraint_name`, `table_schema`, `table_name`)
               WHERE t.`constraint_type` = 'PRIMARY KEY' 
               AND t.`table_schema` = %s AND t.`table_name` = %s AND k.`column_name` = 'clock'""",
            (self.db_name, table), fetch='one'
        )
        return not bool(clock_in_pk)
    def create_future_partitions(self, table: str, period: str, premake_count: int):
        """Create partitions for the future."""
        # Determine start date
        # If table is partitioned, start from the latest partition
        # If not, start from NOW (or min clock if we were doing initial load, but usually NOW for future)
        top_partition_ts = self.execute_query(
            """SELECT MAX(`partition_description`) FROM `information_schema`.`partitions`
               WHERE `table_schema` = %s AND `table_name` = %s AND `partition_name` IS NOT NULL""",
            (self.db_name, table), fetch='one'
        )
        curr_time = self.truncate_date(datetime.now(), period)
        if top_partition_ts:
            start_dt = datetime.fromtimestamp(int(top_partition_ts))
            # Start from the period AFTER the last existing one
            # Actually, MAX(description) is the *end* of the last partition. 
            # e.g. p2023_10_01 VALUES LESS THAN (Oct 2)
            # So start_dt is Oct 2.
        else:
            # No partitions? Should be handled by init, but fallback to NOW
            start_dt = self.truncate_date(datetime.now(), period)
        # Create 'premake_count' partitions ahead of NOW
        # But we must ensure we cover the gap if the last partition is old
        # So we ensure we have partitions up to NOW + premake * period
        target_max_date = self.get_next_date(curr_time, period, premake_count)
        current_planning_dt = start_dt
        new_partitions = {}
        while current_planning_dt < target_max_date:
            part_name = self.get_partition_name(current_planning_dt, period)
            part_desc = self.get_partition_description(current_planning_dt, period)
            new_partitions[part_name] = part_desc
            current_planning_dt = self.get_next_date(current_planning_dt, period, 1)
        if not new_partitions:
            return
        # Generate ADD PARTITION query
        parts_sql = []
        for name, timestamp_expr in sorted(new_partitions.items()):
            parts_sql.append(PARTITION_TEMPLATE % (name, timestamp_expr))
        query = f"ALTER TABLE `{table}` ADD PARTITION (\n" + ",\n".join(parts_sql) + "\n)"
        self.logger.info(f"Adding {len(new_partitions)} partitions to {table}")
        self.execute_query(query)
    def remove_old_partitions(self, table: str, retention_str: str):
        """Drop partitions older than retention period."""
        cutoff_date = self.get_lookback_date(retention_str)
        cutoff_ts = int(cutoff_date.timestamp())
        existing = self.get_existing_partitions(table)
        to_drop = []
        for name, desc_ts in existing:
            # Drop if the *upper bound* of the partition is still older than cutoff?
            # Or if it contains ONLY data older than cutoff?
            # VALUES LESS THAN (desc_ts). 
            # If desc_ts <= cutoff_ts, then ALL data in partition is < cutoff. Safe to drop.
            if desc_ts <= cutoff_ts:
                to_drop.append(name)
        if not to_drop:
            return
        self.logger.info(f"Dropping {len(to_drop)} old partitions from {table} (Retain: {retention_str})")
        for name in to_drop:
            self.execute_query(f"ALTER TABLE `{table}` DROP PARTITION {name}")
    def initialize_partitioning(self, table: str, period: str, premake: int):
        """Initial partitioning for a table (convert regular table to partitioned)."""
        self.logger.info(f"Initializing partitioning for {table}")
        if self.has_incompatible_primary_key(table):
             self.logger.error(f"Cannot partition {table}: Primary Key does not include 'clock' column.")
             return
        # If already partitioned, skip
        if self.get_existing_partitions(table):
             self.logger.info(f"Table {table} is already partitioned.")
             return
        # Check for data
        min_clock = self.get_table_min_clock(table)
        if not min_clock:
            # Empty table. Start from NOW
            start_dt = self.truncate_date(datetime.now(), period)
        else:
             # Table has data. 
             # For a safe migration, we usually create a catch-all for old data (p_old) or just start partitions covering existing data.
             # This script's strategy: Create partitions starting from min_clock.
             start_dt = self.truncate_date(min_clock, period)
        # Build list of partitions from start_dt up to NOW + premake
        target_dt = self.get_next_date(self.truncate_date(datetime.now(), period), period, premake)
        curr = start_dt
        partitions_def = {}
        while curr < target_dt:
            name = self.get_partition_name(curr, period)
            desc = self.get_partition_description(curr, period)
            partitions_def[name] = desc
            curr = self.get_next_date(curr, period, 1)
        parts_sql = []
        for name, timestamp_expr in sorted(partitions_def.items()):
            parts_sql.append(PARTITION_TEMPLATE % (name, timestamp_expr))
        query = f"ALTER TABLE `{table}` PARTITION BY RANGE (`clock`) (\n" + ",\n".join(parts_sql) + "\n)"
        self.logger.info(f"Applying initial partitioning to {table} ({len(partitions_def)} partitions)")
        self.execute_query(query)
    def run(self, mode: str):
        """Main execution loop."""
        with self.connect_db():
            self.check_compatibility()
            partitions_conf = self.config.get('partitions', {})
            premake = self.config.get('premake', 10)
            if mode == 'delete':
                self.logger.warning("Delete Mode: Removing ALL partitioning from configured tables is not fully implemented in refactor yet.")
                # Implement if needed, usually just ALTER TABLE REMOVE PARTITIONING
                return
            for period, tables in partitions_conf.items():
                if not tables:
                    continue
                for item in tables:
                     # Item is dict like {'history': '14d'}
                     table = list(item.keys())[0]
                     retention = item[table]
                     if mode == 'init':
                         self.initialize_partitioning(table, period, premake)
                     else:
                         # Maintenance mode (Add new, remove old)
                         self.create_future_partitions(table, period, premake)
                         self.remove_old_partitions(table, retention)
            # Housekeeping extras
            if mode != 'init' and not self.dry_run:
                # delete_extra_data logic...
                pass # Can add back specific cleanups like `sessions` table if desired
 def setup_logging(config_log_type: str):
    logger = logging.getLogger('zabbix_partitioning')
    logger.setLevel(logging.DEBUG)
    formatter = logging.Formatter('%(asctime)s - %(levelname)s - %(message)s')
    if config_log_type == 'syslog':
        handler = logging.handlers.SysLogHandler(address='/dev/log')
        formatter = logging.Formatter('%(name)s: %(message)s') # Syslog has its own timestamps usually
    else:
        handler = logging.StreamHandler(sys.stdout)
    handler.setFormatter(formatter)
    logger.addHandler(handler)
 def parse_args():
    parser = argparse.ArgumentParser(description='Zabbix Partitioning Manager')
    parser.add_argument('-c', '--config', default='/etc/zabbix/zabbix_partitioning.conf', help='Config file path')
    parser.add_argument('-i', '--init', action='store_true', help='Initialize partitions')
    parser.add_argument('-d', '--delete', action='store_true', help='Remove partitions (Not implemented)')
    parser.add_argument('--dry-run', action='store_true', help='Simulate queries')
    return parser.parse_args()
 def load_config(path):
    if not os.path.exists(path):
        # Fallback to local
        if os.path.exists('zabbix_partitioning.conf'):
            return 'zabbix_partitioning.conf'
        raise ConfigurationError(f"Config file not found: {path}")
    return path
 def main():
    args = parse_args()
    try:
        conf_path = load_config(args.config)
        with open(conf_path, 'r') as f:
            config = yaml.safe_load(f)
        setup_logging(config.get('logging', 'console'))
        logger = logging.getLogger('zabbix_partitioning')
        mode = 'maintain'
        if args.init: mode = 'init'
        elif args.delete: mode = 'delete'
        if args.dry_run:
            logger.info("Starting in DRY-RUN mode")
        app = ZabbixPartitioner(config, dry_run=args.dry_run)
        app.run(mode)
    except Exception as e:
        print(f"Critical Error: {e}", file=sys.stderr)
        sys.exit(1)
 if __name__ == '__main__':
    main()