JOHN DOE - Big Data Solution Architect

JOHN DOE

Big Data Solution Architect - Data Engineering & Transformation

Contact Information

+1 (555) 123-4567

john.doe@example.com

Anytown, USA

LinkedIn Profile


Summary

Highly experienced Big Data Cloud Solution Architect with 12+ years in designing and implementing robust, scalable, and cost-effective cloud-based big data solutions. Proven ability to lead complex data engineering projects from inception to delivery, driving significant business impact and fostering technological innovation within dynamic environments. Passionate about leveraging cutting-edge technologies to solve challenging data problems and optimize data ecosystems.


Technical Skills

Programming Languages:

Python Scala Java SQL Bash

Big Data Frameworks:

Apache Spark (PySpark, Spark SQL, Structured Streaming) Apache Kafka (Confluent, SRM, SMM) Hadoop (HDFS, Hive, HBase) Databricks Snowflake

Cloud Platforms & Services:

Azure (ADLS, Data Bricks, Synapse Analytics, Event Hub, Key Vault, DevOps) AWS (S3, EMR, Lambda, Glue) GCP (BigQuery, Dataflow, GCS)

Databases & Warehousing:

MySQL PostgreSQL Cosmos DB Oracle DB Redshift Snowflake

Certifications

  • Microsoft Certified: Azure Solutions Architect Expert
  • AWS Certified: Solutions Architect - Associate
  • Databricks Certified: Data Engineer Associate
  • Snowflake Certified: SnowPro Core

Key Achievements

  • Architected and delivered a strategic **data platform modernization** initiative, migrating legacy systems to a cloud-native architecture, resulting in a **35% reduction in operational costs** and **improved data processing speeds by 50%**.
  • Led a cross-functional team to develop a **real-time analytics pipeline** using Spark Streaming and Kafka, enabling instant business intelligence and leading to a **15% increase in actionable insights** for critical decision-making.
  • Designed and implemented a scalable **data governance framework** for a large enterprise, ensuring data quality, security, and compliance across diverse datasets and platforms.
  • Mentored and empowered junior data engineers, cultivating a high-performing team that successfully delivered **multiple complex data projects on time and within budget**.

Work Experience

Principal Data Architect | Global Tech Solutions Inc.

Jan 2021 - Present | Location: Remote | Domain: FinTech

  • Led architectural design and implementation of next-generation data lake and data warehousing solutions on Azure, serving diverse analytics and reporting needs.
  • Optimized Spark workloads and data ingestion processes, achieving **2x performance improvement** for critical batch and streaming jobs.
  • Drove the adoption of data mesh principles, fostering decentralized data ownership and enabling self-service analytics for various business units.

Senior Big Data Engineer | Innovative Data Corp.

May 2017 - Dec 2020 | Location: New York, NY | Domain: E-commerce

  • Developed and maintained ETL pipelines using PySpark and Airflow to process terabytes of customer behavior data.
  • Implemented real-time recommendation engines using Kafka and Spark, enhancing user experience and driving a **7% increase in conversion rates**.
  • Collaborated with data scientists to deploy machine learning models at scale, integrating predictions into operational systems.

Data Engineer | Analytics Innovators

Jul 2014 - Apr 2017 | Location: San Francisco, CA | Domain: SaaS Analytics

  • Built and managed Hadoop clusters and Hive data warehouses for storing and querying large datasets.
  • Designed and automated data quality checks, reducing data errors by **20%**.
  • Contributed to the development of a proprietary data ingestion framework for various external data sources.

Education

Master of Computer Science

State University, City, State | 2013

Bachelor of Technology in Computer Engineering

Technology Institute, City, State | 2011