ABOUT THE ROLE:
- The Lead Data Engineer is responsible for leading in the designing, building, and maintaining of the systems and infrastructure that enable organizations to collect, store, process, and analyze large volumes of data. This role involves collaborating closely with data scientists, analysts, QA engineers, business process / functional SMEs, and other stakeholders to ensure that data is accessible, reliable, and secure. They are responsible for tasks such as data ingestion, data transformation, data integration, and data pipeline development. Lead Data Engineers also play a crucial part in data governance, ensuring that data is compliant with regulations and policies. This role also includes all aspects of software development and deployment of IT Operations to shorten delivery and time to market for data products and data pipelines and data warehousing. Overall, the incumbent helps organizations leverage their data to gain meaningful insights and make informed decisions.
WHAT THE ROLE WILL DO:
- Lead and manage a team of data engineers, providing technical guidance, mentorship, and support
- Act as a Subject Matter expert for clients and stakeholders, providing updates and addressing any data related technical inquiries or concerns
- Oversee and drive the design, development, and maintenance of efficient and robust data pipelines, ETL processes, and data workflows
- Manage the development and maintenance of data models and schema designs to support data storage and retrieval needs
- Optimize data storage, processing, and retrieval mechanisms for performance and scalability
- Provide technical leadership and guidance in the selection and implementation of data engineering tools and technologies
- Identify and implement best practices and standards for data engineering processes and tools
- Stay up-to-date with emerging technologies and trends in data engineering, evaluating their potential impact on our data infrastructure
- Lead the Collaboration with cross-functional teams to integrate data sources and enable seamless data flow across systems and with data scientists, analysts, and stakeholders to understand data requirements and develop scalable data solutions
- Lead the implementation and maintenance of data governance and security measures to protect sensitive information and to ensure data quality and integrity by implementing effective data validation and cleansing techniques
Education
- Required
- Bachelor's degree in Computer Science, Computer Engineering, or Information Technology or in lieu of a degree, at least 9 years of experience in data engineering.
Skills and Experience
- 6-8+ years of data engineering experience.
- 4-6+ years of experience with SQL including, but not limited to: PostgreSQL, T-SQL, PL/SQL, SNOWSQL
- 4-6+ years of experience with building applications, system integrations, and web services
- 4-6+ years of experience with server-side scripting and programming in a Linux environment (primarily bash shell)
- Experience with common database (Oracle, Snowflake) technologies including performance, tuning, and optimization.
- Experience with data manipulation and on-prem and cloud-based data warehouse ETL tools and processes
- Experience with business processes such as OTC, P2P, PIM to understand context of data usage
- Experience decoupling monolithic databases and structures into more domain and service-driven designs
- Experience in both structured and unstructured data
- Experience with data design and management for search technologies; including flattening, indexing, and maintaining searchable data
- Methods and technologies for data syncing
Technical Skills
- Domain architecture
- Data security
- Automated testing tools
- Version control tools
- Data modeling
|