> Anand - portfolio

HI THERE

I’m Anand Vidvat,

Data enthusiast +
an Engineer.

  • 3+ Experience
  • 55 Analytical Projects

About Me

As a data enthusiast, I am passionate about identifing patterns in data and levergaing them to pushing the boundaries for business growth.

My mission is build impactful solutions that solve real-world problems and add value to business growth.

I am currently Interning as a Data Analyst at UF Information Technology, where I develop and visualize data-driven solutions that enhance customer satisfaction and reporting efficiency of the IT services. I use Power BI, Python, Salesforce SDK, and SOQL to create dashboards and ETL pipelines that integrate and analyze data from various sources.

I have a strong background in data engineering, having worked at HP for three years. I designed, developed, and deployed over sixty data pipelines using technologies such as Apache Spark, Python, SQL, and Delta Lake in AWS Cloud. I also built a Data Management Framework that simplified data transformation and reduced development time by 70%. I collaborated with product managers, business analysts, and data scientists to prototype data products and machine learning models while complying with data privacy policies

Programing Languages

  • Python
  • R
  • JavaScript
  • Typescript
  • Java
  • Scala
  • C++
  • Shell Scripting
  • HTML
  • Node.js

Framework

Databases & Server

  • MySQL
  • AWS Redshift
  • PostgreSQL
  • MongoDB
  • Oracle Server
  • MS SQL Server
  • Vertica
  • GCP BigQuery
  • Snowflake
  • SFTP Server
  • FTP Server

Tools & Technologies

  • AWS (EC2, RDS, Redshift, Lambda, S3, IAM, SES)
  • Microsoft Azure (App Service, Blob Storage, Azure DevOps, Data Factory)
  • Docker
  • Kubernetes
  • Terraform
  • Node.js
  • Redis
  • Microsoft Excel
  • Microsoft Power BI
  • Delta Lake
  • Git
  • Elastic Search, Logstash, Kibana
  • Splunk
  • Jenkins
  • Tableau
  • Looker
  • Github
  • Gitlab

Work Experience

05/2023 - Present

UF Information Technology

Data Analyst

Developing an ETL pipeline that integrates Salesforce Case Data with Qualtrics Customer Survey Data. Additionally, I created a Power BI dashboard that leveraged the transformed data to report month-on-month growth in customer satisfaction.

  • Leveraged Text Analytics to identify and classify duplicate Cases across CRM applications Divisions in the Salesforce CRM application
  • this impacted over 900 cases per day and reduced overhead by 40% for support teams.
10/2021 - 08/2022

HP Inc

Data Engineer II

worked with product managers, business analysts, and data scientists to prototype data products and machine learning models while complying with data privacy policies such as GDPR.

  • Spearheaded Migration of 55+ Data Pipelines from AWS Public Cloud to AWS VPN with Zero Downtime for end-users
  • Analyzed & enriched Device Telemetry data with enterprise data such as sales, marketing, and subscription to craft customer profile for over 10 million users of HP Instant Ink Service
  • Automated Pipeline Orchestration with Airflow Sensors and Apache Airflow to monitor data lakes in AWS S3 for new updates and boosting Operational Efficiency by 95%
  • Streamlined Data Processing codebase by 70% through standardizing data wrangling framework in PySpark, Delta Lake, and Blue-Green deployment paradigm through Azure DevOps and Terraform with version control in GitHub
  • Developed Data Ingestion services through Linux systems and shell scripting that extract data from databases such as Oracle DB, MS SQL Server, Vertica, and Snowflake and servers such as SFTP Server and FTP Server.
08/2019 - 10/2021

HP Inc

Data Engineer I

As part of Enterprise Data team at Hewlette-Packard, I work on developing data pipelines that provide enrichment and reference data for analytical usage also while building data platforms that support analytical workloads.

  • Built & maintained over 45 Extract, Transform, Load (ETL) pipelines, processing daily around 2 Terabytes of data, with Python, Apache Spark, SQL, and Databricks within the AWS cloud utilizing S3, Redshift, IAM, and EC2 services
  • Drove Adoption & Integration of Apache Airflow with Databricks Jobs API to schedule and monitor Databricks Workflows leading reduced workflow runtime and maintenance time by 40%
  • Developed over 30 Data Ingestion services using Shell Scripts & Crontab to pull data from on-premise sources (Oracle SQL Server, MS SQL Server, Vertica), Snowflake, SFTP, and FTP Servers on Linux systems
  • Created Dashboards using ELK stack to visualize Pipeline Processing and Service Uptime, thus reducing 95% of Manual Efforts for Job Health Monitoring
  • Engineered Continuous Integration/Continuous Delivery (CI/CD) process for integration testing & deployment of software packages leveraging Jenkins, GitHub, and AWS lambda, improving Pipeline Reliability by 50%
01/2019 - 07/2019

HP Inc

Software Engineer Intern

Developed a Node Instrumentation Microservice that monitors and reports resource utilization stats such as CPU usage, Memory usage, Disk, and Network bandwidth utilization in a Linux System. .

  • The Microservice was built using Typescript & JavaScript with Node.js and Redis as a memory store and provides the data through REST APIs for clients to consume..
  • Firmware developers utilise service to test and understand the memory and network bandwidth utilization for their firmware builds on remote clusters.
05/2018 - 06/2018

Centre for Artifical intelligence and Robotics

Machine Learning Engineer Intern

Built a 3D CNN model using the Caffe2 framework over on-premise cluster. The deep learning model was trained on image data (over a million data points) for action recognition through Transfer learning and fine-tuned through the adversarial Learning paradigm.

  • The model achieved an accuracy of 77% on the test data. The model follows Multi Adversarial Domain Adaptation (MADA) Architecture inspired by Generative Adversarial Networks (GANs) and Alexnet.

Education

08/2022 - 05/2024

University of Florida

Warrington College of Business

Master of Science in Information Systems and Operations Management, Specialization in Data Science, GPA : 3.88/4

  • ISOM Director's Award for Academic Excellence, Fall 2023
05/2015 - 06/2019

Vellore Institute of Technology

School of Information Technology and Engineering

Bachelor of Technology in Information Technology, GPA : 3.77/4

Latest works

Analyst on Demand

Built a Question-Answer Bot which is runs on a LLM based model (OpenAI's gpt-3.5) to analyse tesla 10K filing for FY2023

The LLM model uses Retrival Argumentative generation (RAG) to answer imprompt questions

source code