Production Support Engineer (Brisbane, CA)


Responsibilities:

Maintain/monitor/enhance production environment and provide 24x7 system support. Design and deploy production and release related solutions. Support several high volume websites, web services, and file exchanges with business partners. Work with customers and engineering to swiftly resolve technical issues. Provide sound tactical short-term solution to mitigate immediate issues, and contribute to robust strategic long-term architecture.

  • Design and setup production environment
  • Maintain and ensure up-time for all servers/services 24x7
  • Monitor the health of production and test systems 24x7
  • Ability to respond promptly to production issues and alerts 24x7
  • Work with Engineering to:
    • Deploy new and updated websites and services
    • Provide initial assessment and possible workaround of production issue
    • Troubleshoot and resolve production issues
    • Help scope out the hardware and software requirements
  • Work with Partners to:
    • Identify and resolve issues
    • Discuss and plan integration tasks
  • Provide recommendation on hardware/software
  • Automate system health check and overview dashboard
  • Automate monitoring of all the production services
  • Setup production/pre-production/test systems
  • Work with hardware/software/cloud vendors
  • Meet or exceed partners SLAs

Requirements:

  • BS/MS or equivalent in computer science or electrical engineering
  • 2+ years experience in hands-on production administration of large system environment
  • Experience in following, and improving upon, established procedures within a mission critical environment
  • Efficient in writing scripts
  • Must be extremely comfortable using and navigating within a Linux environment
  • Hands on with Apache, Tomcat, JBoss
  • Thorough understanding of web technologies and technology stack
  • Ability to do high level debugging and problem analysis by examining logs and running unix commands
  • Ability to pinpoint problem area in source code by analysing logs and stack traces
  • Experience with open source products
  • Excellent written and verbal communication skills
  • Ability to articulate a problem succinctly and to recommend both short and long term solutions
  • Comfortable operating in fast paced environment
  • Ability to collaborate with all levels of the organization
  • Experience and very comfortable using ssh, tunneling, ssh key pairs
  • Understands performance issues, scaling options, and performance tuning
  • Understands business continuity, fault tolerant design, and fail-over architecture
  • Good understanding of system and web security
  • Understands how DNS works
  • Understands load balancing functionality
  • Knowledge of virtualization and AWS / Softlayer
  • Experience with content management systems (Subversion / Git)
  • Experience with configuration management systems (Ansible / Chef)
  • Experience using splunk

This is a non-management position
This is a full time position

Apply for this jobView All Openings