Hi there!
I'm Yiwen Xiang

Nice To Meet You

About Me



image alt text

Yiwen Xiang

Graduate Student | Open to Work


Incoming Graguate Student at Carnegie Mellon University
Master of Science in Software Engineering
Actively seeking 2024 summer software engineer intern opportunities


My Technology Stack

C TypeScript React CSS3 Google Cloud Platform MongoDB NodeJS Express.js Django JavaScript AWS HTML5 Git PyTorch scikit-learn Styled Componenets MySQL


Experiences


  • September 2023 - Present

    Master of Science in Software Engineering

    Carnegie Mellon University
  • June 2023 - Present

    Software Engineer Intern

    Damo Academy
    Alibaba Group, Inc.
    πŸ‘©πŸ»β€πŸ’» Work
    • Working on a cutting-edge project that involved leveraging LangChain, an innovative language modeling framework, to optimize prompt inputs and enhance the performance of the models.
    • Utilizing state-of-the-art LLM based models like Segment- Anything and MiniGPT-4 to tackle the challenging task of annotating and recognizing specific images. Enabling accurate and efficient annotation of these images.
    πŸ§™β€β™€οΈ Tech Stack
      LangChain, Python, LLMs
    VIEW COMPANY
  • April 2023 - June 2023

    Applied Software Engineer Intern

    Cloud product Development Department
    Hitry, Inc.
    πŸ‘©πŸ»β€πŸ’» Work
    • Developed an internal code analysis tool utilizing ChatGPT for the company. The tool analyzes source code, providing a comprehensive summary of the system's overall functionality. It identifies issues such as syntax errors, security risks, and other code-related problems, and offers suggested solutions and recommendations for resolving these issues.
    • The tool was encapsulated using Gradio, enabling the creation of a user-friendly web interface. This interface allows users to interact with the tool via a web service, making it convenient for internal use within the company's environment.
    • Implemented Docker to containerize the application, ensuring consistency and reproducibility across different environments. The Dockerfile was configured to define the container's image, including the necessary dependencies, libraries, and configurations required for the project to run smoothly.
    πŸ§™β€β™€οΈ Tech Stack
      Gradio, Docker, RESTful API, Node.js
    VIEW COMPANY
  • May 2022 - June 2023

    Research Intern

    Purdue University
    πŸ‘©πŸ»β€πŸ’» Work
    • Played a pivotal role in a semi-automatic image annotation tool design. Leveraged object detection and segmentation to perform comprehensive preprocessing and analysis of images, resulting in automated annotation capabilities.
    • Implemented state-of-the-art technologies including PyTorch and Transformer models to achieve robust and precise object recognition in challenging domains such as medical and automotive camera applications. Extracted crucial information about object position, color, and category, enabling comprehensive understanding of the visual content.
    • Ensured seamless integration with frontend interfaces by converting the model outputs into JSON Web Token, facilitating efficient data transmission and utilization in various downstream applications.
    • The co-authored paper "Rapid Image Labeling via Neuro-Symbolic Learning" has been accepted by KDD 2023.
    πŸ§™β€β™€οΈ Tech Stack
      PyTorch, Transformer, JSON
    VIEW School
  • September 2018 - June 2022

    Bachelor of Engineering

    School of Computer Science
    Chongqing University
  • October 2021 - March 2022

    Software Engineer Dev Intern

    Vertical Industry Innovation Quality & Efficiency Department
    Baidu Inc.
    πŸ‘©πŸ»β€πŸ’» Work
    • Developed a Python-based email efficiency system for 100+ users. Integrated GET and POST requests to retrieve bug-related information from RESTful APIs, including discovery dates and fix durations. Stored data in MySQL and analyzed developer efficiency using custom scripts. Notified low-efficiency developers via email written in Markdown.
    • Scripted the simulation of hundreds of athlete and country data entries in MongoDB for thorough functionality testing in the Beijing Winter Olympics App module. Ensured seamless program execution during the event.
    • Led the development of online real-time monitoring tools for the Winter Olympics. Utilized the urllib.request method to retrieve HTML content from web pages, specifically to analyze and scrape live medal standings from the official website. Conducted API response tests with Postman and retrieved standings from competitor APIs. Implemented a comparison mechanism and sent timely reminders through a software bot in case of delayed updates. Automated report generation and scheduling using Crontab on a Linux system.
    • Implemented CI/CD practices and thorough code reviews to ensure code quality and continuous integration of code.
    πŸ§™β€β™€οΈ Tech Stack
      Mongo DB, RESTful APIs, HTML, requests, CI/CD Pipeline, Linux
    VIEW COMPANY
  • To be
    continued
    ...

Projects

Deep Learning Hackathon 1st Place
ByteVid
ByteVid - Deep Learning Hackathon 1st Place
  • Developed for NTU MLDA Deep Learning Week Hackathon in 48 hours (1/10/2022 - 3/10/2022).
  • Achieved 1st place out of 120 teams.
  • Detailed explanation of our solution: https://me.tjh.sg/blog/bytevid

Project Description

Say goodbye to long and boring videos! πŸ‘‹

Powered by the cutting-edge deep learning technologies in 2022, ByteVid transforms long, boring videos into fun byte-sized content.

Be it a one hour long lecture, or a 30-minute zoom meeting, ByteVid can transcribe, summarise the content, extract keywords, detect and extract important slides from the video, and translate into other languages.

Deep Learning

  • Whisper: SOTA speech recognition (Sep 2022)
  • YOLOv7: SOTA object detection (Jul 2022)
  • KBIR-inspec: key phrase extraction (Dec 2021)
  • Bert Extractive Summarizer: summarisation (Jun 2019)
  • BlingFire: sentence extraction
  • Baidu Translate API: translation

Frontend

  • React.js
  • Tailwind CSS
  • Deploy on GitHub pages

Backend

  • Flask server
  • Deploy on a GPU machine
  • Relay to an Internet-facing VPS
  • Nginx reverse proxy
  • Cloudflare protection

Tools

  • OpenCV
  • youtube-dl
  • ffmpeg
TraViS - Transformer Attention Visualiser
TraViS - Transformer Attention Visualiser
TraViS - Transformer Attention Visualiser
  • Developed a mobile-responsive web app that visualises the attention mechanism of Transformer-based model (BERT) in HTML, CSS, JavaScript, and D3.js
  • Executed the BERT model directly in browser (client-side)
  • Received more than 28 stars on the GitHub repository
Word Piece Tokenizer
Word Piece Tokenizer
Word Piece Tokenizer
  • Developed a Python library that implements a modified, lightweight version of HuggingFace BERT Tokenizer in pure Python
  • My tokenizer can maintain high performance in resource-limited devices like embedded systems and web browsers
  • My tokenizer is 57% faster than the original HuggingFace BERT Tokenizer
More

PUBLICATIONS


Rapid Image Labeling via Neuro-Symbolic Learning
Accepted by KDD 2023, the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining
COVID-19 Diagnosis with Convolution Neural Networks using CT images
Published on CSAMCS 2021, International Conference on Statistics, Applied Mathematics, and Computing Science