Role responsibilities:
?Design and implement data products and features in collaboration with product owners, data analysts, and business partners using Agile / Scrum methodology
?Contribute to overall architecture, frameworks and patterns for processing and storing large data volumes
?Research, evaluate and utilize new technologies/tools/frameworks centered around high-volume data processing
?Translate product backlog items into engineering designs and logical units of work
?Profile and analyze data for the purpose of designing scalable solutions
?Define and apply appropriate data acquisition and consumption strategies for given technical scenarios
?Design and implement distributed data processing pipelines using tools and languages prevalent in the big data ecosystem
?Build utilities, user defined functions, libraries, and frameworks to better enable data flow patterns
?Implement complex automated routines using workflow orchestration tools
?Work with architecture, engineering leads and other teams to ensure quality solutions are implemented, and engineering best practices are defined and adhered to
?Anticipate, identify and solve issues concerning data management to improve data quality
?Build and incorporate automated unit tests and participate in integration testing efforts
?Utilize and advance continuous integration and deployment frameworks
?Troubleshoot data issues and perform root cause analysis
?Work across teams to resolve operational & performance issues
The following qualifications and technical skills will position you well for this role:
?MS/BS in Computer Science, or related technical discipline
?1+ years of experience in large-scale software development, 2+ years of big data experience
?Strong programming experience, Python or Scala preferred
?Experience designing, estimating and executing for complex software projects
?Extensive experience with Hadoop and related processing frameworks such as Spark, Hive, Storm, etc.
?Experience with RDBMS systems, SQL and SQL Analytical functions
?Experience with workflow orchestration tools like Apache Airflow
?Experience with source code control tools like Github or Bitbucket
职能类别: 软件工程师
联系方式
上班地址:市区
Get email alerts for the latest"JD Big Data Engineer jobs in Shanghai"