🔥 Welcome! My name is Xuming Wang. I'm @MarconXM
* Gopher/data engineer / Machine Learning application / Math / Philosophy / Technical Writer.
💼 Data Engineer @ Anduril Partners.
📚 Reading more about algorithms and how the computer works.
💻 With 4 years' computer science and technology education and 5 years' development working experience.
⛵ Encouraging people for open source collaborations.

Skill Set :
💻 Tech Skills:
• Programming Language: Python, Golang, C++ and SQL
• Data Engineering: Cassandra, Postgre SQL, AWS RDS, S3, and AirFlow.
• DevOps: AWS Cloudformation, Jenkins, Circleci, Docker, and Kubernetes.
• Full Stack: JavaScript, Python, Flask, MySQL.
• Machine Learning: Scikit-Learn, Pytorch, TF, SPSS, and SAS.

🎧 Soft Skills:
• Leadership and Project Management
• Communication and Negotiation
• Teaching

â—¦ Data Engineer @ Anduril Partners :
1. Created Windows virtual machines using AWS to analyze financial data using Spotfire data analytics software.
2. Deployed a private Jupyter Notebook platform with AWS Sagemaker and IAM Roles to prevent data theft and provide technical guidance to Uc Berkeley students.
3. Ultilized Python to consolidate and clean up data, and datetime packets to calculate the stock price trends of various companies by year, quarter, month and day.


â—¦ Data Analyst @ Pantos USA :
1. Implement ETL pipeline and relational database, provide cleaned OLAP data and daily reports.
2. Improve data renew frequency and efficiency, developed web-based reporting dashboard.


â—¦ Research Assistant - Object Detection in NYU Langone Medical Center Project:
1. Achieved a 20% reduction in metric RMSE, by building and training machine learning algorithms for object detection and Localization. Localized the damaged pixels in MRI pictures, improving the MRI image quality and diagnosing results.
2. Deployed the End to End solution independently school server, improved model inference time 70% by TensorRT (flask, uWSGI, Apache, HTML, CSS )
3. Link Here .

Accomplishments

Here are accomlishments I have been done in Data Science field including
Data Modeling, Data ETL, Machine Learning Modeling, DevOps and API Deployment etc

Coffee Shop Full Stack

• Built the backend for a coffee shop web application and created API permission and authentication used role-based access management strategies to control different types of user behavior in the app. (PostgreSQL, Auth0, JWT, Python )
• Deployed scalable containerized machine learning web application in the cluster (Flask, Docker, Kubernetes, and Ansible)

Million Songs Dataset in Data Lake

• Designed data modeling and transforming data from various sources into star schema optimized for analysis (PostgreSQL)
• Developed the ETL Pipeline copying datasets from raw data to database, data processing and writing the result to S3 buckets using efficient partitioning and parquet formatting on Cluster(EMR, redshift, S3, spark)

Machine Learning Microservice API

• Designed and implemented multibranch CI/CD pipeline with Jenkins, CircleCI and Github, performingenvironment setting, file linting and unit testing.
• Configured Kubernetes, created a Kubernetes cluster and Deployed containerized application using Docker and make a prediction with Flask framework.

Patients Readmitted Rate Prediction

Pre-processed data with 100000 records of diabetes patients by conducting EDA, data cleaning and implementing appropriate feature encodings with Pandas, Numpy and Scikit-learn libraries.

Experience

-- Teaching Assistant in CS Deep Learning
I believe in the Feynman Learning Technique: The Best Way to Learn Anything is to Teaching it

  • Reviewed all the student's code
  • Drafted homeworks/ final project
  • Read books and related papers
  • Answered DL/ML questiones
  • Implemented ideas by code
  • Privided/Shared SOTA content

Financial Data Analyst @ Wright Star LLC

Oct/19- Dec/19 New York, NY

Designed auto data updating script for data ETF and stock mode. Improved the overall efficiency by enabling the models to update the daily closing price of stocks(Python, AirFlow)
Built DCF model to classifies stocks based on multiple financial indicators, and provide insights to traders in long-term stock investments(Python, Excel)

Operation Analyst @ DHL-SinoTrans International Air Courier LTD

Jan/17-Aug/18 Dongguan, China

Updated data identified root causes and developed potential solutions, Improved the overall customer satisfaction score by 15% by using root cause analysis, and delivered an actionable solution plan(Python and Excel.)
Communicating and leading a group with Sales, Tech, and service center, draw the actionable solution that can improve our KPI in operation efficiency improvement project.

Bio

My name is Xuming Wang. I am a full-stack data scientist (engineer), who is passionate about applying state-of-the-art data mining and machine learning technology to solve challenging large-scale real-world problems.
I am a graduate student at Fordham University Gabelli School of Business MS in Business Analytics and graduated last Dec, familiar with statistics, data structure, and algorithms. With three years of experience as a data analyst at Poly and DHL, I gained more hands-on involvement from data science jobs with a background in Logistics and solid foundations in Finance. I contribute a sound quantitative best practice to perform large-scale data modeling, data analysis with effective statistical models and deployment of reporting dashboards interface that tracks critical business metrics, identifies actionable insights and influence the direction of the business by effectively communicating results to cross-functional groups.
I am currently working as a research assistant, focusing on the machine learning model deployment and system design of the computation cluster. If you are interested in chatting about data science, I would be happy to connect with you on LinkedIn or e-mail me at wang423@fordham.edu.

CV

Here is my latest CV.