Data Engineer (DIO - Edge) - Edge Data Engineering
MORE ABOUT THIS JOB
The Data Intelligence organization aims to make data a strategic asset for the enterprise by providing a platform that enables the structuring, management, integration, control, discovery, usage, and governance of our Data Assets.
The team leverages a wide variety of cutting edge technologies including Hadoop, HBase, Spark, Apache Beam, Apache Flink, Kakfa, SQL, OLAP platforms, Presto, Hive, Java and Python. Your impact will be to Curate, design and catalog high quality data models to ensure that data is accessible and reliable. Build highly scalable data processing frameworks for use across a wide range of datasets and applications. Provide data-driven insight and decision-making critical to GS's business processes, in order to expose data in a scalable and effective manner. Understanding existing and potential data sets in both an engineering and business context.
RESPONSIBILITIES AND QUALIFICATIONS HOW YOU WILL FULFILL YOUR POTENTIAL
• Deploy modern data management tools to curate our most important data sets, models and processes, while identifying areas for process automation and further efficiencies
• Evaluate, select and acquire new internal & external data sets that contribute to business decision making
• Engineer streaming data processing pipelines
• Drive adoption of Cloud technology for data processing and warehousing
• Engage with data consumers and producers in order to design appropriate models to suit all needs SKILLS AND EXPERIENCE WE ARE LOOKING FOR
• 2-3 years of relevant work experience in a team-focused environment
• A Bachelor's degree (Masters preferred) in a computational field (Computer Science, Applied Mathematics, Engineering, or in a related quantitative discipline)
• Working knowledge of more than one programming language (Python, Java, C++, C#, etc.)
• Extensive knowledge and proven experience applying domain driven design to build complex business applications
• Deep understanding of multidimensionality of data, data curation and data quality, such as traceability, security, performance latency and correctness across supply and demand processes
• In-depth knowledge of relational and columnar SQL databases, including database design
• General knowledge of business processes, data flows and the quantitative models that generate or consume data
• Excellent communications skills and the ability to work with subject matter expert to extract critical business concepts
• Independent thinker, willing to engage, challenge or learn
• Ability to stay commercially focused and to always push for quantifiable commercial impact
• Strong work ethic, a sense of ownership and urgency
• Strong analytical and problem solving skills
• Ability to collaborate effectively across global teams and communicate complex ideas in a simple manner Preferred Qualifications
• Financial Services industry experience
• Experience with the Hadoop eco-system (HDFS, Spark)
ABOUT GOLDMAN SACHS The Goldman Sachs Group, Inc. is a leading global investment banking, securities and investment management firm that provides a wide range of financial services to a substantial and diversified client base that includes corporations, financial institutions, governments and individuals. Founded in 1869, the firm is headquartered in New York and maintains offices in all major financial centers around the world.
Â© The Goldman Sachs Group, Inc., 2019. All rights reserved Goldman Sachs is an equal employment/affirmative action employer Female/Minority/Disability/Vet.