Loading...

Portfolio

AUTOMATION
TIDYVERSE
REPRODUCIBLE DATA

Project: Automating IRS Tax Compliance with R & Tidyverse

This project demonstrates the engineering of a reproducible data pipeline designed to automate IRS tax compliance for high-volume collectible sales. By leveraging the R Tidyverse and LaTeX, the system transforms raw, disparately formatted marketplace exports into professional, audit-ready Form 8949 statements. The workflow incorporates advanced data sanitization techniques—including cryptographic hashing and price obfuscation—to ensure privacy for public release without compromising the integrity of the analysis. This “non-destructive” architectural approach replaces manual spreadsheet entry with a single-command solution, providing a scalable and transparent framework for managing complex capital gains reporting.

Learn More
AUTOMATION
TIDYVERSE
REPRODUCIBLE DATA

R Project: Transform Amazon Order History for FSA Reimbursements

This project demonstrates the use of data wrangling skills to convert personal order history data into a format that can be searched for purchases that meet IRS FSA eligibility criteria. The project will leverage the R Tidyverse package to convert the raw order history data into tidy format. The data will be queried to yield the desired order history relevant to FSA eligible purchases. The order history is privacy sensitive, so we will need to sanitize the data to generalize dates, mask personal information, and obfuscate price data, while permitting the end user to utilize the output for follow on tasks.

Learn More