×

See more DevOps and Sysadmin jobs

Back to all jobs

Senior Site Reliability Engineer - Data (REMOTE)

Posted 9 days ago

Discogs is the largest crowd-sourced, community-driven database of recorded music information in the world. Every day, millions of people use the Discogs Marketplace to connect, learn about music, and buy and sell vinyl records, CDs, and cassettes. As Discogs continues to grow, we are looking for bright, dedicated, creative, and highly motivated people to help us realize our mission to serve the music fan in everyone. We are relatively small, so individual contributions can have a large impact. High value is placed on quality, critical thinking, and continuous improvement. Our teams work collaboratively but are distributed geographically and open-source tools are important to who we are and how we work. We value the experiences and skills each team member contributes to helping us serve our music community.

What We Believe In

Discogs’ mission is to serve the music fan in everyone. We represent a diverse and inclusive community, and we are committed to serving our community with innovative and creative solutions. We know that innovation happens best when varying perspectives are embraced and integrated. Our global team reflects our global community.


Discogs is an Equal Opportunity Employer.


Applicants needing accommodation to apply should contact us at 503-597-6340


Discogs does not promote job openings through text messaging. If you receive a text message claiming to offer a position at our company, please disregard it as fraudulent. For a list of our actively open positions and to apply, please visit the official Careers page on our website: https://www.discogs.com/about/careers

If you apply for this role, you will be required to upload a resume, cover letter, and fill out a few questions regarding your application. Once submitted, our hiring team will review your application and contact you if you are selected for an interview. Whether you are successful or not, we will store your application and data in our system for a maximum period of one year from the application date in case another role becomes available that you are suitable for. If you have any questions or concerns about us storing this data and/or the period of time, please contact us at [email protected] and we will respond to you within 30 days.

The Discogs Platform team is focused on several objectives: building and supporting performant, cost-effective, reliable infrastructure; developer experience tooling and mentorship; and creating "golden paths" for organization-wide standards and velocity. As a key member of the Platform team, the Senior Site Reliability Engineer - Data will be working closely with other Discogs engineering squads to develop and optimize scalable, well-planned relational database architectures, drive best practices and stability for our use of Kafka and change data capture, and contribute to the Platform team’s operations.

Location

This is a remote position. Open to candidates located in OR, WA, CA, CO, TX, IL

Compensation

Starting Base Salary Range: $130,000 - $140,000 yearly

What You’ll Accomplish

Reasonable accommodations may be made to enable individuals with disabilities to perform the essential functions.

  • Stewarding Discogs’ data stores as a key subject matter expert
  • Leading efforts on the reliability and design patterns of our Kafka and Kafka Connect implementations
  • Establishing data contracts and clear communication standards between CDC producers and consumers
  • Working closely with engineering squads to refactor and re-architect MySQL database schema and indexing for long-term scalability, performance, and cost effectiveness
  • Mentoring engineering squads on Platform best practices for MySQL, Kafka, and other software development lifecycle areas 
  • Writing documentation and runbooks that contribute to the engineering organization’s knowledge base
  • Working in a containerized, orchestrated environment
  • Contributing to the Platform team’s disciplines of site reliability and operations, supporting both our squads and Platform’s central infrastructure
  • Participating in on-call rotation, responding to incidents, and troubleshooting data and other operations issues

What You’ll Contribute

Minimum Education and Experience

  • A Bachelor's Degree in Computer Science or similar area of focus, or equivalent relevant work experience.
  • 5+ years of experience working with Kafka and relational database management systems (RDBMS).
  • 6+ years experience in Ops, DevOps, Site Reliability, Platform or other systems roles.

Required Skills & Abilities:

  • Relational database schema design, query performance optimization, administration (MySQL, Percona Server, AWS RDS)
  • Kafka: Cluster administration (Strimzi), Kafka Connect (Debezium, JDBC)
  • CI/CD (GitHub Actions)
  • GitOps (ArgoCD)
  • Kubernetes (EKS, Kustomize, Karpenter, administration, application manifests)
  • AWS and cloud development (VPC, EKS, RDS, S3)
  • Observability (Datadog, Sentry)
  • Scripting (Shell, Python)
  • Track record of collaboration and mentorship
  • Excellent written communication and documentation skills
  • Continuous learning
  • Ownership and proactive approach to solving large problems

Preferred:

  • Infrastructure-as-code (Terraform)
  • Elasticsearch (ECK administration, scaling, performance)
  • Python (SQLAlchemy, FastAPI)
  • GraphQL (schema design, Apollo federation)
  • REST API
  • Hashicorp Vault
  • Redis
  • Memcached
  • NoSQL Database
  • Data Lake/Warehouse
  • Data Governance
  • Data Security

The Platform team covers a wide range of technical topics and we'd love to hear about your skills beyond this list!


Apply Now

Let's start your dream job

Meet JobCopilot: Your Personal AI Job Hunter

Automatically Apply to Remote DevOps and Sysadmin Jobs

Just set your preferences and Job Copilot will do the rest-finding, filtering, and applying while you focus on what matters.

Activate JobCopilot

Discogs Inc Star Icon

View company

Jobs posted: 6

About the job
(This job is GeoLocked)

Tired of Applying to Jobs Manually?

Let JobCopilot do it for you.

No more spreadsheets. No more copy-pasting. Just set your preferences and let your Al copilot search, match, and apply to jobs while you sleep.

  • Applies for jobs that actually match your skills

  • Tailors your resume and cover letter automatically

  • Works 24/7-so you don't have to

Activate JobCopilot

Related Jobs Remote DevOps and Sysadmin jobs