Senior DevOps Engineer
Hallmark Labs is a subsidiary of Hallmark Cards, Inc based in Santa Monica, California.
We currently operate two digital subscription services, Hallmark Movies Now and
Hallmark eCards, as well as ongoing initiatives in personalized, print-on-demand
greeting cards. We are a diverse team of innovators, creators and influencers
leveraging Hallmark’s deep experience in creating meaningful connections and
progressing it into the digital age with cutting-edge technology.
Are you great at what you do and passionate about your work? Are you honest and
Are you a self- motivated, go-getter that likes to have fun?
If so, you are in the right place.
Who we are…
Hallmark Labs (a subsidiary of Hallmark), based in Santa Monica, CA, is currently the parent company of two digital subscription services: Hallmark Movies Now and HallmarkeCards.com. And soon we will be launching our Print-On-Demand iOS App, Out Of The Box. We focus on leveraging Hallmark's experience with creating meaningful connections with great sentiments, but pushing it into the digital age at a rapid pace and with the cutting-edge technology.
Systems management and IaaS automation in building, monitoring, maintaining, and alerting Linux systems in AWS to working with QA to ensure their test automation is running correctly on every commit to GitHub. This job is all about automation, IaaS, and uptime. The responsibilities will include change management, access control, addressing any issues that arise that isn't already automated, and work with the Engineering teams to ensure the solution they're deploying are supportable and scalable to support the growing customer base. We love innovation, and support efforts that provide automated systems for the purpose of 99.99% uptime.
This job involves the following responsibilities:
Works closely with other infrastructure, engineering and customer service teams to insure services are available 24 x 7.
Drive technical innovation and efficiency in infrastructure operations via automation.
Design systems management solutions using automation and self-repair rather than relying on alarming and human intervention.
Insure all systems have required security compliance for patch management, anti-virus, and other threat protection
Create processes that enhance operational workflow and provide positive customer impact.
Dive deep to resolve problems at their root, looking for failure patterns amenable to long-term solutions via simplification and automation.
Avoid re-inventing the wheel and prefer appropriately simple, repeatable solutions over more complex and failure prone ones.
Act as a technical point of escalation.
Develop appropriate metrics to demonstrate performance at improving operational efficiency.
Recognize and adopt best practices in documentation, testing, security, operational support at scale, and efficient use of resources.
Must be able to support off-hours on-call.
Problem solving & troubleshooting including performing root cause analysis for preventative analysis.
Work on small, cross-functional, fast paced teams.
Utilize organizational skills and the ability to manage a diversified workload.
Communicate & work effectively with all levels of staff including senior management.
Work under minimal supervision on complex issues to deliver great results on schedule.
- B.S. Degree in Computer Science, Math, or other related fields
5+ years enterprise infrastructure experience
3+ years cloud experience
3+ years Experience with IaaS design and micro-service systems architecture.
3+ years Experience with capacity planning, utilization review, and monitoring of availability and performance.
Held a prior role with responsibility for High Scalability/Availability Systems Architecture, Security, and Systems Support.
Experience with configuration and management of multiple server platforms.
3+ years AWS experience
Experience with automation languages such as Ruby, Python, and Go.
Experience with configuration management tools such as Ansible, Puppet, or Chef.
Experience with continuous integration tools such as Jenkins, Rundeck, Ant, or Maven.
Experience with ELK, Grafana, Zabbix, Cloudwatch, Cloudformation, other open source/cloud ready tools.
Experience in implementing, managing, and refining disaster recovery solutions.
Proficiency in TCP/IP networking, architecture and other core network technologies (DNS, HTTP, Routing, Firewalls, Load Balancers, etc.).
Familiar with both SQL and NoSQL technologies such as MySQL, MongoDB, Redis.
Familiar with Agile processes and DevOps manifesto.
To be considered an applicant for this position you must show how you meet the basic qualifications of the job in a resume or document you upload, or by completing the work experience and education application fields.
In compliance with the Immigration Reform and Control Act of 1986, Hallmark Labs will hire only individuals lawfully authorized to work in the United States. Employment by Hallmark Labs is contingent upon the signing of the Employment Agreement, completing Form I-9 Employment Eligibility Verification and satisfactory reference and background checks.
Excellent medical benefits
401(k) match up to 5%
Life insurance policy for every employee, at no cost
Cell phone and home internet reimbursement
Carpool and parking pass cash-out program
Generous maternity/paternity leave
Employee assistance programs
Fully stocked kitchen with fresh fruit and delicious snacks and beverages
Monthly catered lunches
Soft serve machine 24/7
Great eateries close by, oh, and we’re near the beach!
Massage chairs and bicycles
Onsite free parking
Flexible work hours/work from home
Hallmark Labs is an equal opportunity employer. All qualified applicants will be considered for employment without regard to race, color, religion, sex, age, pregnancy, national origin, physical or mental disability, genetics, sexual orientation, gender identity, veteran status, or any other legally-protected status. Principals only please.