Geo George
Portfolio

Data Analyst well versed in Python, SQL, Excel, Power BI and AWS Cloud Certified

Works at Guy Carpenter, Dubai

PG Diploma Data Analytics - University of Warwick, B.Tech CS - NIT Calicut

Sentiment Analysis of Tweets using Natural Language Processing (NLP)

Designed and implemented a sentiment classifier to classify tweets as positive, negative or neutral utilizing Linear Support Vector Classifier (SVC); Obtained accuracy of 63% and f1-score of 60%. Tweets initially cleaned using regular expressions to remove hyperlinks, URLs and unnecessary characters. TweetTokenizer utilized as it handles emojis, hashtags and other elements of a tweet well. As sentiment of tweet is the only point of interest, stemming is done instead of lemmatization.

Automated Web scraper to collect dynamic university course data in regular intervals using Python, Selenium and VBA

Developed an automated web scraper for a client that fetches data from a dynamic javascript enabled website containing course data for the University of Windsor and stores it in an Excel file every 20 minutes using Python, Selenium and Visual Basic Advanced (VBA). Created a macro button in Excel to run VBA code and start the Python script. Utilized 'xlwings' library to facilitate data collection into a pandas dataframe and establish connection between Python and Excel.

Dashboard for Bike Shop to monitor KPIs and Analyze Trends using Power BI

Created an interactive Power BI dashboard to track KPIs like Revenue, Profit, and Return rate using Sales, Product, and Returns data of a Bike Shop. Used Power Query to perform Extract, Transform, Load (ETL) processes; created Relational Data Models. Data Analysis Expressions (DAX) used to create calculated columns and measures. Separate dashboard pages with dynamic charts, filters, slicers and maps were utilized to compare regional performance, understand customer behavior, and analyze product-level trends.

HR Analytics Dashboard to track KPIs related to employee headcount using Power BI

Utilized Power BI to craft a dashboard to track Key Performance Indicators (KPIs) related to employee headcount statistics like Current headcount, Budgeted headcount, New hires, Retention and Attrition rate etc. for Analytics and Engineering Departments; Utilized dual-axis clustered column charts to compare actual, budgeted and % variance actual versus budgeted headcount by month

Automated Web Scraper using Python to fetch details of data science based startups in UAE for Expand North Star 2023 and store in Excel for Job Applications

Created an automated webscraper using Python and Selenium to extract Company Name, Description, Website and LinkedIn information of all companies in the data science domain in the United Arab Emirates (UAE) for Expand North Star 2023 to use in the future for job applications. First search keyword-'data' and country - 'United Arab Emirates' is used and all relevant startup details are fetched. Then we repeat the same process for the product sector 'Big Data & Analytics'. All the above data is concatenated into a dataframe and duplicate rows are removed. Finally, the dataframe is stored in an excel sheet with 4 columns - 'Company Name', 'Description', 'Website', 'LinkedIn'.

Human Target Search & Detection using Autonomous Unmanned Aerial Vehicle (UAV) and Deep Learning, 2020 IEEE IACT

Played a crucial role in the development of an autonomous drone capable of identifying and monitoring specified targets with high accuracy. A mounted camera is used to give visual feedback and an on-board processing unit runs image recognition algorithms to identify targets in real time. Communication modules are used for relay of information between the drone and base station allowing for instant feedback. Co- presented the project at the 2020 IEEE IACT conference, showcasing innovative use of deep learning in real-world applications.