βπΎ About Me
Detail-oriented and analytical Data Analyst with a strong background in extracting, cleaning, and analyzing complex datasets in the healthcare sector. Proficient in utilizing advanced statistical methods and data visualization techniques to interpret trends and provide actionable insights. Skilled in database management, SQL querying, and proficiency in Python and business intelligence tools such as Power BI. Experienced in collaborating with cross-functional teams to optimize data-driven decision-making processes and enhance organizational efficiency. Strong problem-solving abilities combined with a meticulous attention to detail to ensure data accuracy and reliability. Here's of what I bring to the table:
π» Skills
β’ SQL (SQL Server, MySQL)
β’ Python (Pandas, NumPy, SciPy)
β’ Microsoft Power BI
β’ Excel (VLookup, Conditional Formatting, Pivot Tables, Macros)
β’ Google Sheets
β’ PowerPoint, Word, Share Point, Outlook
π Projects
IDENTIFYING DUPLICATE CLIENTS β Lumenusβs Project β Toronto, ON October 2023 β Leveraged Python to efficiently analyze and deduplicate extensive datasets comprising tens of thousands of client records. First, employed Python to cleanse the data by eliminating extraneous spaces and commas. Next, implemented a comprehensive comparison strategy by evaluating key client attributes, including names, last names, Date of Birth, addresses,and parental information. Assigned weights ranging from one to three to each metric based on their significance and aggregated these weights to quantify the overall similarity between potential duplicate clients. This approach significantly accelerated the detection of duplicate entries, streamlining the overall process.
DATA CLEANUP - Lumenusβs Projectβ Lumenusβs Project β Toronto, ON February 2024 β In preparation for transitioning to a new database, the organization needed to clean up client data, including email, phone number, address, city, and province. I leveraged Python to write regular expressions that identified and corrected issues such as invalid phone numbers and emails, extracting comments from emails to their correct fields, and separating street names from numbers. This method drastically sped up the cleaning process, enabling the correction of a large volume of data. For minor adjustments and to keep track of changes, I used Excel, which also facilitated the final update to the backend with the cleaned data, streamlining the entire process.