# OnyxKraken AI: AI-Readable Project Brief

Canonical project page: https://markvizion.com/projects/onyxkraken-ai
Live application: https://onyxkraken-face.netlify.app/
Owner: Mark Barnes / MarkVizion
Category: ai
Stack: Python, FastAPI, Ollama, YOLO, whisper.cpp, React

## Summary

A fully autonomous AI desktop entity with vision, voice, memory, knowledge retrieval, and self-improving code generation. It sees your screen, hears you, and acts independently.

## Problem

Most agent demos stop at chat and tool calls. OnyxKraken investigates what happens when an AI system can see, remember, route tools, and act in a desktop environment.

## Architecture

- Local-first agent stack with screen vision, voice input, memory, and FastAPI service boundaries.
- Capability routing connects perception, planning, tool execution, and feedback loops.
- Modular face pack, event bus, and Blender automation layers make behavior visible.

## Evidence

- Live hosted application: https://onyxkraken-face.netlify.app/
- Live hosted public face/app surface connected to the flagship system identity.
- Documented capabilities include YOLO vision, whisper.cpp voice, RAG memory, and 15+ API endpoints.
- Blender automation exercises show action beyond text output.

## Tradeoffs

- Local autonomy requires careful permissions, inspectability, and safety boundaries.
- Desktop agents need environmental awareness before they can be trusted with larger workflows.

## Outcome

The flagship proof system for MarkVizion: autonomous agent architecture, local AI, and creative tool control.

## Authority Signal

Primary evidence for local-first autonomous AI infrastructure and digital embodiment.

## Keywords

AI Agent, Autonomous, Computer Vision, Self-Improving, Python, FastAPI, Ollama, YOLO, whisper.cpp, React, Primary evidence for local-first autonomous AI infrastructure and digital embodiment.
