Lead Azure Data Engineer with Databricks - Empower (remote/US-based)
Chicago, IL, United States
Applications have closed
Hitachi Solutions
Company Description
Company Overview
Hitachi Solutions is a global solutions integrator passionate about designing, developing, and delivering cutting edge cloud solutions to help our clients innovative across their entire business. Our firm develops the business services and technology powering some of the products you use every day – and is closely aligned with Microsoft and other leaders in the cloud computing space.
What sets Hitachi Solutions apart is both our industry focus, and the intellectual property that we bring to our customers. Recognized for our achievements year after year, we strive to be the trusted advisor of large and medium sized enterprises alike – helping them move fast to achieve strategic business initiatives with distinguished engineering, hard work, and compassion. With over 3,000 team members across 14 countries, in our 18 years of focus our company has seen explosive growth and high customer satisfaction. This has allowed us to offer exceptionally compelling salaries, 401k match, family leave, and health benefits. And no – we will not make you come into an office or ask for an inflexible work schedule.
A part of Hitachi, Ltd., our company has a long and rich history of innovation, financial strength, and international presence of one of the world’s largest companies. Since 1910, Hitachi, Ltd. has been a leader in manufacturing innovative products and solutions that support industry and social infrastructure around the globe supported by 303,000 employees in over 100 countries and across 864 companies.
New Product Development and Innovations Team
This position in our company is housed in our New Product Development and Innovations team formed in 2021. Joining this team represents an opportunity to fast-track your career and to work with a team of fun and nerdy colleagues in a disruptive atmosphere: well-funded, focused on hypergrowth, moving quickly, and making mistakes in the furtherance of innovation and sound engineering.
Armed with an existing book of business, and a stable financial parent – it is the goal of this group to help our firm introduce products to enhance our already strong services business – ultimately making the cloud easier for our customers, and allowing us to hit our long term financial goals with greater-than linear scale.
Job Description
LEAD DATA ENGINEER (DATABRICKS, AZURE, PYTHON, SPARK)
This is a full-time role in our product organization for an expert in big data systems design with considerable skill and expertise in data architecture, especially in big data systems (Spark and other EDW technology).
Individuals in this role will assist in the design, development, enhancement, and maintenance of complex data pipelines products that manage business critical operations, and large-scale analytics pipelines. Qualified applicants will have a demonstrated capability to learn new concepts quickly, have a data engineering background, and/or have robust software engineering expertise.
Responsibilities
- Scope and execute together with team leadership. Work with the team to understand platform capabilities and how to best improve and expand those capabilities.
- Strong independence and autonomy.
- Design, development, enhancement, and maintenance of complex data pipeline products which manage business-critical operations and large-scale analytics applications.
- Experience leading mid- and senior engineers.
- Support analytics, data science and/or engineering teams and understand their unique needs and challenges.
- Instill excellence into the processes, methodologies, standards, and technology choices embraced by the team.
- Embrace new concepts quickly to keep up with fast-moving data engineering technology.
- Dedicate time to continuous learning to keep the team appraised of the latest developments in the space.
- Commitment to developing technical maturity across the company.
Please note: Although our position is remote / virtual / work-from-home, you MUST reside, and be authorized to work, in the US or Canada without sponsorship.
Qualifications
- 5+ years of Azure Data Engineering experience including 2+ years designing and building Databricks data pipelines is REQUIRED; experience with conceptual, logical and/or physical database designs is HIGHLY DESIRED
- 2+ years of hands-on Python/Pyspark/SparkSQL experience is REQUIRED
- 2+ years of experience with big data pipelines or DAG Tools (Dbt, Data Factory, Airflow, or similar) is REQUIRED
- 2+ years of Spark experience (especially Databricks Spark and Delta Lake) is REQUIRED
- 2+ years of hands-on experience implementing big-data solutions in the Azure ecosystem including Data Lakes is REQUIRED
- 2+ years of experience with source control (git) on the command line is REQUIRED
- 2+ years of SQL experience, specifically to write complex, highly optimized queries across large volumes of data is HIGHLY DESIRED
- Strong data modeling / data profiling capabilities with Kimball/star schema methodology is HIGHLY DESIRED
- Professional experience with Kafka or other streaming technology is HIGHLY DESIRED
- Professional experience with database deployment pipelines (i.e., dacpac’s or similar technology) is HIGHLY DESIRED
- Professional experience with one or more unit testing or data quality frameworks is HIGHLY DESIRED
#LI-CA1
#REMOTE
#AZURE
#DATABRICKS
#SPARK
Additional Information
We are an equal opportunity employer. All applicants will be considered for employment without attention to age, race, color, religion, sex, sexual orientation, gender identity, national origin, veteran or disability status.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Airflow Architecture Azure Big Data Databricks Data pipelines Data quality Engineering Git Kafka Pipelines PySpark Python Spark SQL Streaming Testing
Perks/benefits: 401(k) matching Career development Health care Startup environment
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open Marketing Data Analyst jobs
- Open MLOps Engineer jobs
- Open AI Engineer jobs
- Open Junior Data Scientist jobs
- Open Data Engineer II jobs
- Open Senior Data Architect jobs
- Open Sr Data Engineer jobs
- Open Senior Business Intelligence Analyst jobs
- Open Data Analytics Engineer jobs
- Open Power BI Developer jobs
- Open Manager, Data Engineering jobs
- Open Product Data Analyst jobs
- Open Principal Data Engineer jobs
- Open Business Data Analyst jobs
- Open Data Quality Analyst jobs
- Open Data Manager jobs
- Open Sr. Data Scientist jobs
- Open Data Scientist II jobs
- Open Big Data Engineer jobs
- Open Business Intelligence Developer jobs
- Open Data Analyst Intern jobs
- Open Principal Data Scientist jobs
- Open ETL Developer jobs
- Open Azure Data Engineer jobs
- Open Data Product Manager jobs
- Open Business Intelligence-related jobs
- Open Data quality-related jobs
- Open Privacy-related jobs
- Open Data management-related jobs
- Open GCP-related jobs
- Open Java-related jobs
- Open ML models-related jobs
- Open Finance-related jobs
- Open Data visualization-related jobs
- Open Deep Learning-related jobs
- Open APIs-related jobs
- Open PyTorch-related jobs
- Open PhD-related jobs
- Open TensorFlow-related jobs
- Open Consulting-related jobs
- Open Snowflake-related jobs
- Open NLP-related jobs
- Open Data governance-related jobs
- Open Data warehouse-related jobs
- Open Airflow-related jobs
- Open Databricks-related jobs
- Open Hadoop-related jobs
- Open LLMs-related jobs
- Open DevOps-related jobs
- Open CI/CD-related jobs