Solutions Architect (Data Engineering)

  • Bengaluru
  • Phdata

phData is revolutionizing the data industry. As the premier data services provider specializing in Machine Learning and AI services for data and application modernization, we partner with the premier technology companies across the modern data platform like Snowflake, AWS, Azure, Fivetran, GCP and dbt to deliver cutting-edge services and solutions. We're committed to helping global enterprises overcome their toughest data challenges. Even though we're growing extremely fast, we maintain a casual, exciting work environment. We hire top performers and allow you the autonomy to deliver results.


phData is a remote-first global company with employees based in the United States, Latin America and India. We celebrate the culture of each of our team members and foster a community of technological curiosity, ownership and trust. Even though we're growing extremely fast, we maintain a casual, exciting work environment. We hire top performers and allow you the autonomy to deliver results.

  • 4x Snowflake Partner of the Year (2020, 2021, 2022, 2023)
  • Fivetran, dbt, Atlation, Matillion Partner of the Year
  • #1 Partner in Snowflake Advanced Certifications
  • 600+ Expert Cloud Certifications (Sigma, AWS, Azure, Dataiku, etc)
  • Recognized as an award-winning workplace in US, India and LATAM
  • Inc 5000 Fastest Growing US Companies (2020-2023)


Required Experience:

  • 10+ years as a hands-on Solutions Architect and/or Data Engineer designing and implementing data solutions
  • Team lead, and/or mentorship of other engineers
  • Ability to develop end-to-end technical solutions into production — and to help ensure performance, security, scalability, and robust data integration.
  • Programming expertise in Java, Python and/or Scala
  • Core cloud data platforms including Snowflake, AWS, Azure, Databricks and GCP
  • SQL and the ability to write, debug, and optimize SQL queries
  • Client-facing written and verbal communication skills and experience
  • Create and deliver detailed presentations
  • Detailed solution documentation (e.g. including POCS and roadmaps, sequence diagrams, class hierarchies, logical system views, etc.)
  • 4-year Bachelor's degree in Computer Science or a related field


Prefer any of the following:

  • Production experience in core data platforms: Snowflake, AWS, Azure, GCP, Hadoop, Databricks
  • Cloud and Distributed Data Storage: S3, ADLS, HDFS, GCS, Kudu, ElasticSearch/Solr, Cassandra or other NoSQL storage systems
  • Data integration technologies: Spark, Kafka, event/streaming, Streamsets, Matillion, Fivetran, NiFi, AWS Data Migration Services, Azure DataFactory, Informatica Intelligent Cloud Services (IICS), Google DataProc or other data integration technologies
  • Multiple data sources (e.g. queues, relational databases, files, search, API)
  • Complete software development lifecycle experience including design, documentation, implementation, testing, and deployment
  • Automated data transformation and data curation: dbt , Spark, Spark streaming, automated pipelines
  • Workflow Management and Orchestration : Airflow, AWS Managed Airflow, Luigi, NiFi


Why phData? We Offer:

  • Remote-First Workplace
  • Medical Insurance for Self & Family
  • Medical Insurance for Parents
  • Term Life & Personal Accident
  • Wellness Allowance
  • Broadband Reimbursement
  • Continuous learning and growth opportunities to enhance your skills and expertise
  • Other benefits include paid certifications, professional development allowance, and bonuses for creating for company-approved content