![phData logo](https://static.remoteliz.com/static/companies/company-phdata.io-logo.jpg)
Senior Data Engineer
phDataJob Summary
We are seeking a Data Engineer with at least 4+ years of experience in software engineering, data engineering, or data analysis. The role involves developing end-to-end technical solutions into production, ensuring performance, security, scalability, and robust data integration. Core programming expertise in Java, Python, or Scala is required, along with proficiency in cloud data platforms like Snowflake, AWS, Azure, Databricks, and GCP. Strong SQL skills are essential for writing, debugging, and optimizing queries. Additionally, the candidate should have client-facing communication skills, experience creating detailed presentations, and documenting solutions thoroughly. A 4-year bachelor's degree in Computer Science or a related field is required. Preferred qualifications include production experience with core data platforms, cloud and distributed data storage systems, data integration technologies, handling multiple data sources, full software development lifecycle experience, automated data transformation tools like dbt, and workflow management systems such as Airflow or Luigi. phData offers a remote-first workplace, medical insurance for self & family and parents, term life & personal accident coverage, wellness allowance, broadband reimbursement, continuous learning opportunities, paid certifications, professional development allowance, and bonuses for creating company-approved content.
Company Benefits
- ✓Remote-First Workplace
- ✓Medical Insurance for Self & Family
- ✓Medical Insurance for Parents
- ✓Term Life & Personal Accident
- ✓Wellness Allowance
- ✓Broadband Reimbursement
- ✓Continuous learning and growth opportunities
- ✓paid certifications
- ✓professional development allowance
- ✓bonuses for creating company-approved content
Required Experience:
At least 4+ years experience as a Software Engineer, Data Engineer or Data Analyst
Ability to develop end-to-end technical solutions into production — and to help ensure performance, security, scalability, and robust data integration.
Programming expertise in Java, Python and/or Scala
Core cloud data platforms including Snowflake, AWS, Azure, Databricks and GCP
SQL and the ability to write, debug, and optimize SQL queries
Client-facing written and verbal communication skills and experience
Create and deliver detailed presentations
Detailed solution documentation (e.g. including POCS and roadmaps, sequence diagrams, class hierarchies, logical system views, etc.)
4-year Bachelor's degree in Computer Science or a related field
Prefer any of the following:
Production experience in core data platforms: Snowflake, AWS, Azure, GCP, Hadoop, Databricks
Cloud and Distributed Data Storage: S3, ADLS, HDFS, GCS, Kudu, ElasticSearch/Solr, Cassandra or other NoSQL storage systems
Data integration technologies: Spark, Kafka, event/streaming, Streamsets, Matillion, Fivetran, NiFi, AWS Data Migration Services, Azure DataFactory, Informatica Intelligent Cloud Services (IICS), Google DataProc or other data integration technologies
Multiple data sources (e.g. queues, relational databases, files, search, API)
Complete software development lifecycle experience including design, documentation, implementation, testing, and deployment
Automated data transformation and data curation: dbt, Spark, Spark streaming, automated pipelines
Workflow Management and Orchestration: Airflow, AWS Managed Airflow, Luigi, NiFi
Why phData? We Offer:
Remote-First Workplace
Medical Insurance for Self & Family
Medical Insurance for Parents
Term Life & Personal Accident
Wellness Allowance
Broadband Reimbursement
Continuous learning and growth opportunities to enhance your skills and expertise
Other benefits include paid certifications, professional development allowance, and bonuses for creating for company-approved content
#LI-DNI