remote
Database Reliability Engineer - Lawrence Livermore National Laboratory
Software Engineer
The Database Reliability Engineer ensures high‑availability, performance, and scalability of critical data platforms by applying automation, monitoring, and troubleshooting techniques across SQL, PostgreSQL, and MySQL environments on Linux systems.
About the role
Key Responsibilities
- Design, implement, and maintain automated deployment and configuration pipelines for relational databases.
- Monitor database health, performance, and capacity, proactively identifying and resolving issues.
- Develop scripts and tools (Python, Bash) to streamline routine maintenance, backup, and recovery processes.
- Collaborate with application and infrastructure teams to optimize schema design, query performance, and data security.
- Participate in on‑call rotation, providing rapid incident response and root‑cause analysis for database outages.
Requirements
- 3+ years of experience managing production SQL, PostgreSQL, or MySQL databases on Linux platforms.
- Strong scripting skills in Python or Bash for automation and troubleshooting.
- Hands‑on experience with monitoring tools (e.g., Prometheus, Grafana) and alerting systems.
- Knowledge of backup, recovery, replication, and high‑availability architectures.
- Excellent problem‑solving abilities and a collaborative mindset.
Skills
sqlpostgresqlmysqllinuxpython