KoEmo How to Use

From installation to voice input, step by step.

01

Install

Double-click the downloaded file to run
installer file
Follow the setup wizard (default settings are fine)
installer screen
After installation, launch KoEmo
Requirements: Windows 10/11 + NVIDIA GPU (CUDA) required.
02

Launch

KoEmo icon in system tray

> When you launch KoEmo, an icon appears in the system tray at the bottom-right of your screen.

> The settings window opens automatically. If you can see the icon, you're ready to go.

KoEmo settings window (initial state)

Works right away with default settings. No changes needed.

Note: On first launch, KoEmo loads a ~2 GB speech recognition model into memory, which may take around 20 seconds.
03

Voice Input

🎤
Mic
🧠
STT Engine
📝
Text Output
Click and focus the app you want to type into (Notepad, browser, etc.)
Press the "Start" button in KoEmo
Speak into your microphone
When you stop speaking, text is automatically pasted
Settings window while running

After pressing "Start", the button turns into a red "Stop" button

Overlay during speech recognition

Speak and the overlay shows real-time results

04

Direct Input

> Step 03 explained the behavior with Direct Input ON (default). Turn it OFF to review results before confirming.

Direct Input ON (Default)

Auto-pastes when you stop speaking. No keyboard needed.

Speak Auto Paste Done
Direct Input OFF

Review the result before confirming. For accuracy.

Speak Review Confirm
Settings window with Direct Input OFF

Unchecking Direct Input reveals "Confirm Key" and "Confirm Wait" settings

Keyboard Controls (Direct Input OFF)

When OFF, use these keys to control recognition results.

Shift
ConfirmInstantly paste text during recognition
Esc
CancelCancel anytime
During speech recognition

Recognizing: Shift to confirm / Esc to cancel

Waiting for confirmation

Confirm: Shift to paste / Esc to cancel

TIP: Press Esc anytime to cancel. Works in both ON and OFF modes.
05

Settings

> If recognition isn't working well or gets cut off, try adjusting these two settings.

VAD Threshold

Default: 0.60

Sensitivity of Voice Activity Detection.
KoEmo constantly listens to your mic and uses this value to determine if someone is speaking.

0.1 Sensitive
Reacts to quiet voices
May pick up noise
0.9 Less sensitive
Requires clear speech
Resistant to noise
Guide: Quiet room: 0.3–0.5 — Noisy environment: 0.6–0.8

Silence Duration

Default: 0.8s

The length of silence before speech is considered finished.
When silence lasts this long, KoEmo finalizes the recognition result.

0.3s Quick response
Splits on brief pauses
2.0s Relaxed
Allows thinking pauses
Guide: Gets cut off? Raise to 1.0–1.5s. Want faster input? Try around 0.5s
06

Tray Menu

> Even after closing the settings window, KoEmo keeps running in the background.

> Right-click the system tray icon to control KoEmo quickly.

Tray menu (stopped)

Stopped — Click "Start Recognition" to begin

Tray menu (running)

Running — Click "Stop Recognition" to stop

$ troubleshoot

  • Select the correct microphone in "Audio Device" settings
  • Lower the VAD threshold (mic sensitivity) to detect quieter voices
  • Check that your microphone is enabled in Windows Sound Settings
  • Increase the Silence Duration to prevent splitting during pauses
  • Make sure you're not too far from the microphone
  • Click and focus the target app before you start speaking
  • KoEmo pastes into whichever app was focused when speech began
  • Use confirmation mode (Direct Input OFF) to choose from candidates
  • Speak in shorter sentences for better accuracy
  • Using KoEmo in a quiet environment improves recognition