Skip to content

🔧 AUTOMATION TOOLS

Balazs, is this something we need and should use? Is it even realistic to expect lab members to use these tools?

Tool 1: Storage Monitor (Prevent crises)

Purpose: Alert before running out of space

Usage:

# Run weekly (automated via cron)
python storage_monitor.py /RDS_Lab_Storage --threshold-warning 80 --threshold-critical 90

Output: - Email alert when >80% full (warning) - Email alert when >90% full (critical) - Weekly report of storage trends

See storage_monitor.py script (provided earlier)


Tool 2: Imaging Compressor (Free up space)

Purpose: Compress old imaging data automatically

Usage:

# Compress all TIFF images older than 90 days
python compress_imaging.py --input /RDS/00_ACTIVE_PROJECTS --days-old 90 --format ome-tiff

Expected results: - 30-50% size reduction - No quality loss - Maintains all metadata - Automatically verifies integrity


Tool 3: Organization Validator (Enforce structure)

Purpose: Check projects follow required structure

Usage:

# Check all active projects weekly
python validate_organization.py /RDS/00_ACTIVE_PROJECTS

Checks: - README.md exists - metadata.yaml exists and valid - File naming conventions followed - Required subdirectories present

Output: - List of non-compliant projects - Specific issues for each project - Email report to Data Steward


Tool 4: Backup Verifier (Ensure backups work)

Purpose: Verify backup integrity automatically

Usage:

# Verify all checksums in backup directory
python verify_backup.py /backup/external_hdd

Checks: - All files have checksums - Checksums match actual file hashes - No corrupted files in backup

Output: - Number of files verified - List of any errors found - Email report if errors detected