Major Highlights
UI/UX Improvements:
Major updates to the frontend UI for a smoother, more intuitive user experience.
Improved filtering, pagination, and event handling for course and certification management.
Enhanced accessibility and responsiveness across devices.
Course Index & Search:
The course search index (powered by Meilisearch) is now production-ready.
Bulk upload and ingestion scripts for course and certification data have been improved for reliability and speed.
Search and filtering are now faster and more accurate.
Centralized Logging Overhaul:
All server logs (FastAPI, Meilisearch, ClickHouse) are now forwarded to a single, secure FastAPI host using rsyslog over UDP. This enables unified monitoring, easier troubleshooting, and better compliance.
Robust Log Rotation & Retention:
Logrotate is now configured for all critical logs, including remote logs, nginx, and fail2ban. This prevents disk overuse and ensures historical logs are available for 14 days (remote logs) and 4 weeks (fail2ban).
Automated Monitoring & Alerting:
Custom scripts and systemd timers now monitor disk usage, log file sizes, logrotate health, and rsyslog service status. Email alerts are sent proactively to the admin team for any detected issues.
Security & Maintenance:
Kernel and package upgrades applied (now running kernel 6.11.0-26-generic).
All services (nginx, rsyslog, fail2ban, etc.) confirmed healthy post-reboot.
Fail2ban jails for SSH and FastAPI/nginx endpoints are active and protecting the stack.
Bug Fixes:
Resolved duplicate log rotation and log entry issues.
Fixed UI bugs related to filter logic, pagination, and event listeners.
Addressed backend ingestion errors and improved error handling for bulk uploads.
General stability and performance improvements across the stack.
Documentation:
The infrastructure runbook has been updated to reflect all new logging, monitoring, and alerting procedures, as well as UI and search index changes.
How This Benefits You
User Experience: Faster, more intuitive, and more robust course and certification management.
Search: Finding CPE will never be more easy and enjoyable.
Reliability: Proactive monitoring and alerting means issues are caught before they impact users.
Security: Up-to-date kernel and active fail2ban protection.
Maintainability: Centralized logs, improved UI, and clear documentation make troubleshooting and audits a breeze.
Thank you to everyone who contributed to this release!