Data Engineer vacancy at Sayari

Sayari

Posted a year ago

Description

About Sayari:

Sayari is the counterparty and supply chain risk intelligence provider trusted by government agencies, multinational corporations, and financial institutions. Its intuitive network analysis platform surfaces hidden risk through integrated corporate ownership, supply chain, trade transaction and risk intelligence data from over 250 jurisdictions. Sayari is headquartered in Washington, D.C., and its solutions are used by thousands of frontline analysts in over 35 countries.

Our company culture is defined by a dedication to our mission of using open data to enhance visibility into global commercial and financial networks, a passion for finding novel approaches to complex problems, and an understanding that diverse perspectives create optimal outcomes. We embrace cross-team collaboration, encourage training and learning opportunities, and reward initiative and innovation. If you like working with supportive, high-performing, and curious teams, Sayari is the place for you.

Job Description:

Sayari’s flagship product, Sayari Graph, provides instant access to structured business information from billions of corporate, legal, and trade records. As a member of Sayari's data team you will work with the Product and Software Engineering teams to collect data from around the globe, maintain existing data pipelines, and develop new pipelines that power Sayari Graph.

Job Responsibilities:

Write and deploy crawling scripts to collect source data from the web
Write and run data transformers in Scala Spark to standardize bulk data sets
Write and run modules in Python to parse entity references and relationships from source data
Diagnose and fix bugs reported by internal and external users
Analyze and report on internal datasets to answer questions and inform feature work
Work collaboratively on and across a team of engineers using agile principles
Give and receive feedback through code reviews

Skills & Experience:

Professional experience with Python and a JVM language (e.g., Scala)
2+ years of experience designing and maintaining data pipelines
Experience using Apache Spark and Apache Airflow
Experience with SQL and NoSQL databases (e.g., columns stores, graph, etc.)
Experience working on a cloud platform like GCP, AWS, or Azure
Experience working collaboratively with Git
Understanding of Docker/Kubernetes
Interest in learning from and mentoring team members
Experience supporting and working with cross-functional teams in a dynamic environment
Passionate about open source development and innovative technology
Experience working with BI tools like BigQuery and Superset is a plus
Understanding of knowledge graphs is a plus

$100,000 - $125,000 a year

The target base salary for this position is $100,000 - $125,000 USD plus bonus. Final offer amounts are determined by multiple factors including location, local market variances, candidate experience and expertise, internal peer equity, and may vary from the amounts listed above.

Benefits:

· Limitless growth and learning opportunities

· A collaborative and positive culture - your team will be as smart and driven as you

· A strong commitment to diversity, equity & inclusion

· Exceedingly generous vacation leave, parental leave, floating holidays, flexible schedule, & other remarkable benefits

· Outstanding competitive compensation & commission package

· Comprehensive family-friendly health benefits, including full healthcare coverage plans, commuter benefits, & 401K matching

Sayari is an equal opportunity employer and strongly encourages diverse candidates to apply. We believe diversity and inclusion mean our team members should reflect the diversity of the United States. No employee or applicant will face discrimination or harassment based on race, color, ethnicity, religion, age, gender, gender identity or expression, sexual orientation, disability status, veteran status, genetics, or political affiliation. We strongly encourage applicants of all backgrounds to apply.

Please mention the word **INSPIRATION** and tag RMzQuODYuMTYyLjEzMw== when applying to show you read the job post completely (#RMzQuODYuMTYyLjEzMw==). This is a beta feature to avoid spam applicants. Companies can search these words to find applicants that read this and see they're human.

Apply for this job