Capabilities

What Thaluna Actually Includes

Thaluna is not only a live OCR window. It combines real-time OCR, beta Audio Translation, Lens Mode, backlog history, transcript saving, built-in local models, Ollama, OpenAI-compatible local APIs, optional cloud models, hardware controls, presets, shortcuts, and manga-focused reading tools inside one Windows app.

Core Workflows in Practice

Thaluna real-time translation workflow
Real-time mode keeps a floating translation window synced with changing text on screen.
Thaluna Lens Mode workflow
Lens Mode handles static overlays, selected regions, preserved layout reading, and the new Continuous Lens workflow for changing scenes.
Thaluna backlog viewer
Backlog history makes long reading sessions easier to follow and revisit.
Thaluna session transcript settings
Session recording can save translation-only or original-plus-translation transcripts to TXT.

Capture and Reading Modes

Real-time OCR mode

Use a movable OCR capture area and a separate translation window for games, videos, subtitles, and live dialogue.

Audio Translation BETA

Use live loopback/system audio transcription in Real-Time mode when OCR is unreliable or subtitles are unavailable.

Lens Mode

Switch to a snapshot-based overlay workflow for full-screen scans, selected areas, and on-image translation placement.

Continuous Lens

Inside Lens Mode, monitor a selected area and refresh overlay translations automatically when the content changes.

Manga Mode

Lens Mode includes manga-focused grouping for speech bubbles and comic-style layouts.

Vertical text support

Snapshot mode includes a vertical-text option for harder Japanese layouts instead of assuming only left-to-right text.

OCR language selection

OCR language is selected separately from the translation target, so you can match recognition to what is actually on screen.

Overlay appearance controls

Lens overlays support font-size changes plus text color, background color, and background opacity customization for readability on complex artwork.

Translation Stack

Thaluna Ollama and OpenRouter settings
Ollama, Custom OpenAI-compatible Local API, and OpenRouter are configured inside the app instead of requiring a separate front-end. The built-in local workflow still works without any external server.

Built-in local models

The default workflow includes local translation, so the app can work offline without forcing a separate cloud account.

Optional Ollama integration

Refresh and use your own Ollama models inside the same app when you want a local model stack beyond the built-in workflow.

Custom OpenAI-compatible Local API

Connect a local OpenAI-compatible server such as llama.cpp, LM Studio, or another compatible endpoint with a base URL, model ID, and optional API key.

Optional OpenRouter cloud mode

Add an API key, choose a model, and use cloud translation when you want more quality or broader model choice.

OpenRouter Speech-to-Text

Audio Translation can use OpenRouter STT models to transcribe live system audio before sending recognized text through the normal translation backend.

Optional Real-Time TTS

Use your existing OpenRouter API key to read translated text or original OCR text aloud with selectable voice and speech speed.

Target-language flexibility

Built-in languages cover the normal workflow, with additional flexibility available through Ollama, Custom Local API, and OpenRouter.

Long-Session Tools

Thaluna backlog viewer
Backlog history is built for reviewing earlier translated lines instead of losing them after each OCR refresh.
Thaluna window presets
Presets restore both the OCR capture region and the translation window layout for different games, players, or reading setups.
Thaluna session transcript settings
Session recording can save translation-only or original-plus-translation transcripts to TXT for later reading and archiving.

Backlog viewer

Review previous translated lines in a separate panel, similar to VN-style dialogue history.

Session transcript saving

Save sessions to TXT and choose between translation-only or original-plus-translation output.

Duplicate filtering

Session recording filters repeated lines so long transcripts stay readable instead of filling with repeated OCR noise.

Window presets

Save and load named layouts for different games, videos, emulators, or reading setups.

Minimal Mode

Hide most of the interface and keep only the translated output visible when you want less UI clutter.

Performance and Control

Thaluna performance options
Performance settings expose OCR mode, optional real-time OCR auto-crop, CPU/GPU selection, low-CPU behavior, auto-pause, and separate OCR versus translation device choices.
Thaluna experimental Windows DPI fix setting
The experimental DPI fix adds an alternate real-time capture path for Windows scaling above 100% without changing the default behavior unless you enable it.
Thaluna Lens settings page
The dedicated Lens settings page exposes Continuous Lens sensitivity, minimum auto-refresh delay, anti-flicker behavior, manual Lens OCR options, Hide After timing, and overlay readability controls.

OCR mode selection

Switch between standard, game/pixel-text, and manga-oriented OCR behavior depending on the content.

Real-time OCR auto-crop

Optionally trim excessive empty margins before OCR when subtitles occupy only a small part of the capture region.

Audio latency presets

Audio Translation includes latency presets, with Stable recommended by default for better transcription quality.

OCR device control

Choose CPU, GPU, or auto for OCR instead of relying on a hidden hardware choice.

Translation device control

Choose CPU, GPU, or auto for local translation models when you want speed or compatibility control.

Low CPU Mode

Reduce refresh pressure when you are playing or reading on weaker hardware.

Auto-Pause

Pause processing automatically when the text stops changing instead of wasting resources on static scenes.

Windows scaling compatibility

An experimental alternate capture path can improve real-time OCR alignment above 100% scaling on supported Windows setups.

Continuous Lens tuning

Adjust sensitivity, minimum auto-refresh delay, and anti-flicker behavior instead of relying on hidden defaults.

Customization and Shortcuts

Theme and color control

Adjust theme, custom text color, background color, and UI color for the real-time translation window.

Snapshot appearance control

Lens Mode has separate text color, background color, and background opacity settings so overlay reading stays legible.

Lens settings page

Manual Lens OCR behavior, Continuous Lens controls, Hide After timing, and overlay readability controls now live in dedicated settings instead of being buried in general performance options.

Global shortcuts

Trigger Minimal Mode, hide icon, toggle mode, quick snapshot, select area, toggle overlay, and load presets through hotkeys.

Text readability controls

Font size and outline thickness are exposed so the translated output remains readable over bright or noisy content.

How this page was built

This page is based on Thaluna's actual current feature set as implemented and documented in the app itself: beta Audio Translation, session transcript recording, snapshot/Lens controls, presets, hardware settings, shortcut handling, Ollama/OpenRouter configuration, and backlog behavior. It is meant to describe the real product surface, not generic marketing copy.

Test the full workflow yourself

The best way to judge Thaluna is to test the exact workflow you care about: live dialogue, snapshots, backlog, transcripts, local models, and hardware settings on your own machine.