Project :: ECE 445 - Senior Design Laboratory

Project

#	Title	Team Members	TA	Documents	Sponsor
45	AI-based Meeting Transcription Device	Chang Liu Gao Gao Ziyang Huang	Jiankun Yang	design_document1.docx final_paper1.pdf other1.pdf other2.txt photo2.jpg photo3.jpg photo1.jpg presentation1.pptx proposal1.pdf
	## Team Members: - Ziyang Huang (ziyangh3) - Gao Gao (xgao54) - Chang Liu (changl21) ## Problem During the pandemic, we found Zoom’s live transcription very useful, as it helped the audience catch up quickly with the lecturer. In many professional and academic settings, real-time transcription of spoken communication is essential for note-taking. Additionally, individuals with hearing impairments face challenges in following spoken conversations, especially in environments where captions are unavailable. Existing solutions, such as Zoom’s live transcription or mobile speech-to-text apps, require an internet connection and are often tied to specific platforms. To address this, we propose a standalone, portable transcription device that can capture, transcribe, and display spoken text in real time. The device will be helpful since it provides a distraction-free way to record and review conversations without relying on a smartphone or laptop. ## Solution Our Smart Meeting Transcription Device will be a portable, battery-powered device that records with a microphone, converts speech into real-time text, and displays it on an LCD screen. The system consists of the following key components: 1. A microphone module to capture audio input. 2. A speech processing unit (Jetson Nano/Raspberry Pi/Arduino) running the Vosk speech-to-text model to transcribe the captured speech. 3. An STM32 microcontroller, which serves as the central controller for managing user interactions, processing text display, and storing transcriptions. 4. An LCD screen to display transcriptions in real-time. 5. External memory (SD card or NOR flash) for saving transcribed conversations. 6. A power system (battery with efficient power management) to enable portability. --- ## Solution Components ### Subsystem 1: Speech Processing Unit - Function: Captures audio and converts speech into text using an embedded speech-to-text model. - Microphone Module: Adafruit Electret Microphone Amplifier (MAX9814) - Processing Board: Jetson Nano / Raspberry Pi 4B - Speech Recognition Model: Vosk Speech-to-Text Model - Memory Expansion (if required): SD card (SanDisk Ultra 32GB) ### Subsystem 2: STM32 Central Controller - Function: Manages the user interface, processes the transcribed text, and sends data to the LCD screen. - Microcontroller: STM32F4 Series MCU - Interface Components: Buttons for navigation and text saving - Memory Module: SPI-based NOR Flash (W25Q128JV) ### Subsystem 3: Display Module - Function: Displays real-time transcriptions and allows users to scroll through previous text. - LCD Screen: 2.8-inch TFT Display (ILI9341) - Controller Interface: SPI Communication with STM32 ### Subsystem 4: Power Management System - Function: Provides reliable and portable power for all components. - Battery: 3.7V Li-ion Battery (Adafruit 2500mAh) - Power Regulation: TP4056 Li-ion Charger + 5V Boost Converter - Power Optimization: Sleep mode for STM32 to enhance battery life --- ## Criterion for Success 1. The device must accurately transcribe speech to text with reasonable latency. 2. The LCD screen must display real-time transcriptions clearly. 3. The STM32 must successfully manage system operations and communicate with peripheral components. 4. The system should support local storage for saving transcriptions. 5. The battery life should last at least 2-3 hours under normal usage conditions.

Title

Team Members

Documents

Sponsor

AI-based Meeting Transcription Device

Chang Liu
Gao Gao
Ziyang Huang

Jiankun Yang

design_document1.docx
final_paper1.pdf
other1.pdf
other2.txt
photo2.jpg
photo3.jpg
photo1.jpg
presentation1.pptx
proposal1.pdf

## Team Members:
- **Ziyang Huang** (ziyangh3)
- **Gao Gao** (xgao54)
- **Chang Liu** (changl21)

## Problem

During the pandemic, we found Zoom’s live transcription very useful, as it helped the audience catch up quickly with the lecturer. In many professional and academic settings, real-time transcription of spoken communication is essential for note-taking. Additionally, individuals with hearing impairments face challenges in following spoken conversations, especially in environments where captions are unavailable.

Existing solutions, such as Zoom’s live transcription or mobile speech-to-text apps, require an internet connection and are often tied to specific platforms. To address this, we propose a standalone, portable transcription device that can capture, transcribe, and display spoken text in real time. The device will be helpful since it provides a distraction-free way to record and review conversations without relying on a smartphone or laptop.

## Solution

Our **Smart Meeting Transcription Device** will be a portable, battery-powered device that records with a microphone, converts speech into real-time text, and displays it on an LCD screen. The system consists of the following key components:

1. **A microphone module** to capture audio input.
2. **A speech processing unit** (Jetson Nano/Raspberry Pi/Arduino) running the Vosk speech-to-text model to transcribe the captured speech.
3. **An STM32 microcontroller**, which serves as the central controller for managing user interactions, processing text display, and storing transcriptions.
4. **An LCD screen** to display transcriptions in real-time.
5. **External memory** (SD card or NOR flash) for saving transcribed conversations.
6. **A power system** (battery with efficient power management) to enable portability.

---

## Solution Components

### **Subsystem 1: Speech Processing Unit**
- **Function:** Captures audio and converts speech into text using an embedded speech-to-text model.
- **Microphone Module:** Adafruit Electret Microphone Amplifier (MAX9814)
- **Processing Board:** Jetson Nano / Raspberry Pi 4B
- **Speech Recognition Model:** Vosk Speech-to-Text Model
- **Memory Expansion (if required):** SD card (SanDisk Ultra 32GB)

### **Subsystem 2: STM32 Central Controller**
- **Function:** Manages the user interface, processes the transcribed text, and sends data to the LCD screen.
- **Microcontroller:** STM32F4 Series MCU
- **Interface Components:** Buttons for navigation and text saving
- **Memory Module:** SPI-based NOR Flash (W25Q128JV)

### **Subsystem 3: Display Module**
- **Function:** Displays real-time transcriptions and allows users to scroll through previous text.
- **LCD Screen:** 2.8-inch TFT Display (ILI9341)
- **Controller Interface:** SPI Communication with STM32

### **Subsystem 4: Power Management System**
- **Function:** Provides reliable and portable power for all components.
- **Battery:** 3.7V Li-ion Battery (Adafruit 2500mAh)
- **Power Regulation:** TP4056 Li-ion Charger + 5V Boost Converter
- **Power Optimization:** Sleep mode for STM32 to enhance battery life

---

## **Criterion for Success**
1. The device must accurately transcribe speech to text with reasonable latency.
2. The LCD screen must display real-time transcriptions clearly.
3. The STM32 must successfully manage system operations and communicate with peripheral components.
4. The system should support local storage for saving transcriptions.
5. The battery life should last at least **2-3 hours** under normal usage conditions.

Featured Project

Team Members:

Anita Jung (anitaj2)

Sungmin Jang (sjang27)

Zheng Liu (zliu93)

Link to the idea: https://courses.engr.illinois.edu/ece445/pace/view-topic.asp?id=27710

Problem:

The Donor Wall on the southwest side of first floor in ECEB is to celebrate and appreciate everyone who helped and donated for ECEB.

However, because of poor lighting and color contrast between the copper and the wall behind, donor names are not noticed as much as they should, especially after sunset.

Solution Overview:

Here is the image of the Donor Wall:

http://buildingcampaign.ece.illinois.edu/files/2014/10/touched-up-Donor-wall-by-kurt-bielema.jpg

We are going to design and implement a dynamic and interactive illuminating system for the Donor Wall by installing LEDs on the background. LEDs can be placed behind the names to softly illuminate each name. LEDs can also fill in the transparent gaps in the “circuit board” to allow for interaction and dynamic animation.

And our project’s system would contain 2 basic modes:

Default mode: When there is nobody near the Donor Wall, the names are softly illuminated from the back of each name block.

Moving mode: When sensors detect any stimulation such as a person walking nearby, the LEDs are controlled to animate “current” or “pulses” flowing through the “circuit board” into name boards.

Depending on the progress of our project, we have some additional modes:

Pressing mode: When someone is physically pressing on a name block, detected by pressure sensors, the LEDs are controlled to

animate scattering of outgoing light, just as if a wave or light is emitted from that name block.

Solution Components:

Sensor Subsystem:

IR sensors (PIR modules or IR LEDs with phototransistor) or ultrasonic sensors to detect presence and proximity of people in front of the Donor Wall.

Pressure sensors to detect if someone is pressing on a block.

Lighting Subsystem:

A lot of LEDs is needed to be installed on the PCBs to be our lighting subsystem. These are hidden as much as possible so that people focus on the names instead of the LEDs.

Controlling Subsystem:

The main part of the system is the controlling unit. We plan to use a microprocessor to process the signal from those sensors and send signal to LEDs. And because the system has different modes, switching between them correctly is also important for the project.

Power Subsystem:

AC (Wall outlet; 120V, 60Hz) to DC (acceptable DC voltage and current applicable for our circuit design) power adapter or possible AC-DC converter circuit

Criterion for success:

Whole system should work correctly in each mode and switch between different modes correctly. The names should be highlighted in a comfortable and aesthetically pleasing way. Our project is acceptable for senior design because it contains both hardware and software parts dealing with signal processing, power, control, and circuit design with sensors.

Project Videos

Team 18 Demo Video

Project

Interactive Proximity Donor Wall Illumination

Featured Project

Project Videos