Prithvi has a Master's in Data Science from the USA complemented by a Bachelor's in Computer Science from India, He brings a robust academic foundation to his role as a Data Science Leader with a global perspective. Over 5 years of hands-on experience, Prithvi has spearheaded the development, deployment, and management of data applications. His expertise spans the entire data lifecycle, from extraction and transformation to insightful analysis and visualization. With a proven track record in meticulous data validation, He ensures the accuracy and integrity of critical information and is Passionate about leveraging data-driven insights, Prithvi drive's strategic decision-making and foster business growth through innovative approaches.
Developing and managing POC applications that serves as a proof of concept for data solutions at scale. It's a hands-on project where I get to explore new approaches and technologies, and it’s something I get to present to stakeholders to validate our ideas.
Designing and implementing robust data pipelines that handle large amounts of data efficiently. These pipelines are built to process, transform, and move data across various systems, so they need to be reliable and scalable.
Integrating data from various sources into our systems, ensuring it's accurate and consistent. This requires understanding the structure of the data and ensuring it's clean and ready for analysis.
Working with services like AWS S3 for storing data, Redshift for data warehousing, and EC2 for running our compute tasks. This allows us to scale up or down based on our needs and manage data effectively in the cloud.
GitHub Actions to automate much of the work. This means I can set up continuous integration and delivery pipelines to automatically test, build, and deploy code. It’s key for speeding up development and reducing manual errors.
One of my main tasks is to optimize the performance of our data processing workflows. Whether it’s improving how fast data is ingested or reducing costs related to storage or compute, I’m always looking for ways to make our systems more efficient.
I work with data engineers, managers, data scientists, analysts, and product teams to understand their requirements and translate those into technical solutions. Whether it’s providing them with clean data for analysis or creating dashboards, we collaborate closely to ensure the final product meets everyone’s needs.
Given the sensitivity of data, ensuring our solutions are secure and compliant with regulations is critical. I follow best practices for data governance, so we make sure the data is protected, and the organization stays compliant.
Spearheaded black box testing initiatives on the AWS data processing framework, ensuring robust data quality and reliability.
Identified errors and meticulously documented bug issues in JIRA for resolution, streamlining the debugging process.
Closed bug issues after thorough retesting to ensure alignment with business requirements and data integrity.
Championed test automation by devising solutions in PySpark, automating redundant test cases within regression testing on vast datasets, significantly improving data quality and testing efficiency.
Significantly reduced manual testing efforts and enhanced performance by expediting test report delivery using PySpark automation techniques.
Leveraged GitHub workflow actions, scripted YAML Workflow, and automated regression tests with PySpark, mitigating issues and substantially improving performance and deployment processes.
Meticulously created test scenarios, test cases, and test data for development features, ensuring comprehensive test coverage and data accuracy.
Documented and audited test reports during daily sprints and on release days, maintaining transparency and accountability in testing processes.
Managed offshore team members by assigning tasks through JIRA and providing pre-sprint presentations, ensuring seamless implementation of new processes and effective collaboration.
Donned multiple hats – from Developer and Tester to DevSecOps, leveraging tools such as JIRA, GitHub, Python, and PySpark to drive efficiency and innovation in data engineering projects.
Achieved operational excellence and drove test automation solutions that aligned with company standards, enhancing overall data processing and quality assurance frameworks using PySpark.
2018 - 2019
Ensured quality assurance of daily operations at North American Fulfillment Centers, utilizing MS Excel for data analysis and reporting to maintain high operational standards.
Instrumentally provided timely and relevant responses as part of an automation initiative, leveraging Python scripting to automate repetitive tasks and improve response times.
Contributed significantly to the enhancement of tools and features through active participation in BETA testing programs, using data analysis with Pandas to identify areas of improvement.
Actively utilized responses generated during video monitoring activities as crucial training data for AI models, employing Python for data preprocessing and annotation.
Showcased versatility and adaptability by commencing as a Training Associate, contributing significantly to the development and delivery of comprehensive training programs using MS Excel for curriculum tracking and progress analysis.
Observed and managed agile processes to streamline project development and foster cross-functional collaboration in the pursuit of quality excellence.
Showcased versatility and adaptability by commencing as a Training Associate, contributing significantly to the development and delivery of comprehensive training programs
Provided valuable insights and feedback that directly impacted the enhancement of tools and features, highlighting my dedication to maintaining the highest standards of quality and innovation through data-driven approaches using Python and MS Excel.
Databricks, AWS ( S3, Lambda, Glue, Redshift, Eventbridge, Cloudwatch, SNS, Kafka, Airflow), SnowFlake, Pyhton, Scala, SQL, PostgresSQL, GitHub Actions, Terraform, Tableau, PowerBI, Qliksense, GitHub Workflow actions, YAML scripts, GitHub Version Control, JIRA, Confluence, Agile Methodology, Software Life Cycle, Machine Learning, Windows, Mac
Prithvi is a strong candidate for anything. He performs His work diligently with passion. His work is exemplary and instrumental to our team and org.
He has always had an open mind to learning new things and that go forward attitude in any projects sent His way. Great job Prithvi!
Prithvi is an exceptional data engineer with strong ownership, adaptability, and deep technical expertise.
He's highly dependable, collaborative, and consistently exceeds expectations. I highly recommend him for any data engineering role.
Prithvi has been an amazing comrade to all of us excelling in every domain of his work, tackling challenges with technical brilliance.
I have learnt great qualities from him how to persevere through any longing problem without losing hope and how to tackle constant criticism at work.
He is a wonderful individual at heart and I definitely recommend him to be a fruitful addition to whichever organization and team he works in.
It has been an absolute privilege to collaborate with Prithvi, whose exceptional professionalism and expertise consistently stand out.
His solution approach towards any challenge has undoubtedly left an indelible positive mark on our collaborative works.
One of his notable attributes is that he is punctual ,disciplined and always keeps his word. He meets deadlines and always maintains the quality.
He is not only a reliable team player but also a natural leader who inspires confidence in those around him.
I start by deeply understanding your goals, challenges, and context. Together, we define the scope and outcomes to ensure we’re aligned from day one.
I provide a clear plan and timeline, outlining key phases, deliverables, and responsibilities. You’ll always know what to expect and when.
I believe in open, regular communication—whether through check-ins, shared documents, or quick updates. I keep you informed and involved without overwhelming your time.
I focus on delivering high-quality results that meet your needs—and I welcome feedback. At the end of each engagement, I reflect with clients to ensure impact and continuous improvement.