Job Description
Summary
As Senior DevOps Engineer at NocSmart, you will design and build cloud infrastructure solutions and automation to increase scalability, security, and engineering productivity. You will help define the best practices and methodologies for infrastructure setup, upgrade, automation and maintenance while collaborating closely with peers. In addition, you will be a hands-on contributor to the DevOps team in addressing our company cloud infrastructure, CI-CD, observability, security, and automation.
Responsibilities
- Automate infrastructure provisioning (prefer infrastructure-as-code), DevOps, and continuous integration/delivery
- Build tools, processes, scripts, and workflows that help accelerate the engineering team so we can build better products faster
- Build, deploy and upgrade our real-time compute infrastructure
- Design and implement zero-downtime to accomplish highly available service (99.99%)
- Design and implement disaster recovery (DR) between different region Datacenters
- Ensure our systems are continuously monitored and running efficiently
- Engage in capacity planning and demand forecasting, anticipating performance bottlenecks, and scaling the environment as needed
- Troubleshoot production issues, including performance and function issues
- Configure tooling for system scalability, ensure we have the capacity for future growth
- Participate and improve general software development, deployment, and operations
- Mentor other software engineers on DevOps best practices
Education & Skills
- 5+ years working with AWS, deep understanding of AWS concepts, services, and fluency with the AWS APIs/command-line tools.
- Expert level proficiency with Terraform (Infrastructure as Code)
- Expert level proficiency and automation experience with Python (using boto for aws)
- Expert level proficiency with containerization and docker
- Experience with Github
- Experience with building automation scripts and writing Infrastructure as code scripts in python or any other scripting language
- Source Control: git
- Kubernetes experience preferred
- Build: Github Actions, Jenkins or any other standard build tool
- Security: iptables, firewalld, SSH, TLS, IAM, VPC, VPN
- Systems: Linux: CentOS, Amazon Linux
- Monitoring: Cloudwatch, Datadog, Grafana
- AWS Services: Lambda,Fargate, ECS Tasks, S3, DynamoDB, RDS, Route53, VPC, CloudFront, CloudWatch, etc.
Abilities
- Strong automation mindset and a passion for root cause analysis and strategies to mitigate issues
- Excellent problem-solving skills in isolation, triage, and correlation, with strong ownership
- Understanding of how cloud services can be secured, and you can articulate to others how they can better achieve best practices
- Passion for change and doing things differently, with an innovative mindset.
- Able to work under pressure in a fast-paced environment.
- A driven individual who can meet deadline requirements and project deliverables
- Able to support multiple global geographic business regions
- Up to date on the latest industry trends, able to articulate trends and potential clearly and confidently
- Ability and flexibility to work after hours/weekends for application releases/deployments and off-hour production emergencies