Associate Data Engineer
Requisition ID # 144035
Job Category: Information Technology
Job Level: Individual Contributor
Business Unit: Information Technology
Work Type: Hybrid
Job Location: Oakland
Department Overview
Information Systems Technology Services is a unified organization comprised of various departments which collaborate effectively in order to deliver high quality technology solutions.
Position Summary
We are looking for a savvy Data Engineer to join our growing team of analytics experts. In this role you will work as part of cross functional teams, including data scientists, other data engineers, technology experts, and subject matter experts to develop data driven solutions. Successful candidates will be responsible for building, expanding, and optimizing our data, data storage, and data pipeline. This individual will support team members and decision products to ensure that data delivery is reliable and optimized. Qualified candidates will have a unique opportunity to be at the forefront of the utility industry and gain a comprehensive view of the nation’s most advance smart grid. It is the perfect role for someone who would like to continue to build upon their professional experience and help advance PG&E’s sustainability goals.
PG&E is providing the salary range that the company in good faith believes it might pay for this position at the time of the job posting. This compensation range is specific to the locality of the job. The actual salary paid to an individual will be based on multiple factors, including, but not limited to, specific skills, education, licenses or certifications, experience, market value, geographic location, and internal equity. We would not anticipate that the individual hired into this role would land at or near the top half of the range described below, but the decision will be dependent on the facts and circumstances of each case.
A reasonable salary range is:
Bay Area Minimum: $75,000.00
Bay Area Mid-point: $94,000.00
Bay Area Maximum: $113,000.00
&/OR
California Minimum: $71,000.00
California Mid-point: $89,000.00
California Maximum: $107,000.00
This position is hybrid, working from your remote office and your assigned location based on business need.
Responsibilities:
Work under the guidance of senior data engineers on moderate complexity data and analytics-centric problems which have broad impact and require in-depth analysis to obtain desired results.
Plans work to meet mutually agreed to objectives; reviews progress with team members regularly; documents work in a manner that is aligned with best practices
Identifies, designs, and implements internal process improvements including re-designing infrastructure for greater scalability, optimizing data delivery, and automating manual processes.
Resolves application programming analysis problems of broad scope within procedural guidelines.
Actively participates in agile/scrum ceremonies (stand ups, planning, retrospectives, etc.)
Builds and conducts automated and user acceptances tests as part of an iterative development process
Promotes a continuous improvement mindset by engaging in after action reviews and sharing learnings
Broader Data System and Architecture Maturity
Assesses performance of individual data pipeline and broader data systems, suggests and may implement changes as required to meet individual project or enterprise maturity objectives
Conceptualizes and generates infrastructure that allows big data to be accessed and analyzed with verified data quality and metadata is appropriately captured and catalogued
Work with team leadership to continually improve data driven decision making at PG&E via demonstrations, mentoring, disseminating best practices, etc.
Minimum Qualifications:
BA/BS or equivalent experience in appropriate technical discipline (Computer Science, Management Information Systems) or equivalent experience and/or related field of study
1 year of experience with data engineering/ETL ecosystems such as Palantir Foundry, Spark, Informatica
Experience with data engineering and data transformations via a training or apprenticeship program acceptable
Required Skills
Python
Pandas
APIs
Scikit Learn
PySpark or equivalent big data processing framework
Database design fundamentals
Typescript
Web application development
Familiarity with a CI/CD tool
Familiarity with an infrastructure as code tool
Experience writing production-level code
Experience writing health checks, unit tests, integration tests, schema validations
Knowledge of Time Series data set development.
Demonstrated commitment to teamwork and enabling others
Proven ability to translate business desires into technical requirements
Ability to communicate with various stakeholders and leadership
Ability to breakdown an ambiguous problems
Desired Skills
Typescript
Web application development
Familiarity with cloud computing security fundamentals
Experience with the Palantir Foundry platform
Experience working with data scientists and machine learning engineers
Familiarity with model deployment
Front end tools: PowerBi, Tableau