Sakana AI – What It Is, How It Works, and Why It Could Change the Future of AI

MarGib June 25, 2026
🌐 🇵🇱 Polski · 🇬🇧 EN

Sakana AI is one of the fastest-growing AI companies in the world, having become a Japanese unicorn valued at over $2.6 billion in just a few years (Startup Intros – Sakana AI Funding). But this isn’t just another startup churning out another language model. Sakana AI focuses on collective intelligence, evolution, and localized adaptation—what it calls “AI that creates AI.”

In this article, we’ll explore:

  • what Sakana AI is and when it was founded,
  • how its core technologies—Namazu, Fugu, and Marlin—work,
  • what problems they solve (e.g., refusal to answer, export restrictions, long-term research),
  • who’s behind it and why it could matter for the future of AI.


1. What Is Sakana AI and When Was It Founded?

Sakana AI is a Tokyo-based research and development lab focused on Frontier AI—technology that meets the highest global standards. The company was founded in July 2023 by three experienced researchers and managers:

  • David Ha – CEO, former Head of Research at Google Brain and Head of Research at Stability AI.
  • Llion Jones – CTO, co-author of the groundbreaking paper “Attention Is All You Need”, which introduced the Transformer architecture.
  • Ren Ito – Chairman, former COO of Stability AI with experience scaling AI startups.

Its official mission is: “Building Frontier AI in Japan”—developing world-class AI tailored to Japan’s needs while competing with global giants (Sakana AI Corporate Info).

The company operates across three main pillars:

  1. Research – nature-inspired research (evolution, collective intelligence),
  2. Applied – solutions for finance, defense, and infrastructure,
  3. Product – tools such as Sakana Chat, Sakana Marlin, and Sakana Fugu.

Sakana AI quickly secured funding from top-tier VC firms (including Khosla Ventures, Lux Capital, and Nvidia) and major Japanese megabanks (MUFG, SMBC, Mizuho), becoming one of Japan’s fastest-growing unicorns (Startup Intros – Sakana AI Funding).


2. Namazu – How to Turn a Global Model Into a “Japanese” One Without Losing Power

2.1. The Problem: Global Models Don’t Fit Japan

Most large language models (LLMs) come from the U.S. or China. They’re trained on global data, but:

  • they may carry strong ideological biases (e.g., political, historical),
  • they often refuse to answer on sensitive topics (self-censorship),
  • they’re not optimized for the Japanese language or local cultural context.

According to Sakana AI’s research, some foreign models refuse to answer questions about politics, history, or diplomacy in 72% of cases (BigGo Finance – Namazu post-training). This makes them practically useless in business or administration, where objective, fact-based responses are required.

2.2. The Solution: Namazu Post-Training

Sakana AI developed a post-training technique that:

  • takes existing large open-source models (e.g., DeepSeek-V3.1-Terminus, Llama-3.1-405B),
  • and adapts them to Japan using specialized training on data that reflects Japanese cultural and security contexts.

The result is a series of prototype models called Namazu (alpha version) that:

  • maintain high performance on standard benchmarks (AIME’25, MMLU-Redux, GPQA Diamond, LiveCodeBench, IFEval),
  • while radically improving neutrality and accuracy on political and historical topics (Sakana AI – Namazu Alpha).

Key outcome: the refusal rate for sensitive questions dropped from 72% to nearly 0% in the Namazu-DeepSeek-V3.1-Terminus model (BigGo Finance – Namazu post-training).

2.3. How Does It Work Technically?

Namazu isn’t a model built from scratch. It’s an adaptation of existing models:

  1. A high-quality open-source model is selected (e.g., DeepSeek-V3.1-Terminus).
  2. Sakana AI builds a specialized dataset that reflects Japanese cultural, political, and security contexts.
  3. The model is fine-tuned (post-trained) to:
    • better understand Japanese,
    • be more neutral in responses,
    • stop refusing to answer sensitive topics without justification.

Benchmark results show that Namazu performs comparably to base models on math, logic, and coding tasks, but significantly outperforms them on politics and history (Sakana AI – Namazu Alpha).

2.4. Sakana Chat – A Free Namazu-Powered Chatbot

Built on Namazu is Sakana Chat—a free chatbot primarily for users in Japan that:

  • uses Namazu models as its engine,
  • includes web search capabilities,
  • allows real-time comparison and integration of information from multiple sources (GIGAZINE – Sakana Chat & Namazu).

Sakana Chat was tested by around 1,000 beta users, and their feedback helped refine both the model and the interface (Sakana AI – Namazu Alpha).


3. Sakana Fugu – A Multi-Agent System as One Model

3.1. Why One Model Isn’t Enough

Most AI companies offer single models: GPT-5.5, Claude Opus, Gemini, etc. But every model has strengths and weaknesses. Sakana AI took a different path: instead of building one giant model, it created a multi-agent system that behaves like a single model.

Sakana Fugu is:

  • a multi-agent system (MAS),
  • that dynamically coordinates a pool of different LLM models,
  • accessible via a single OpenAI-compatible API (Sakana AI – Fugu).

From the user’s perspective, it works like this:

  • you send a query to one endpoint,
  • you specify a model fugu or fugu‑ultra,
  • and Fugu internally decides which models to use and how to coordinate them.

3.2. Architecture: TRINITY and Conductor

Fugu’s architecture is based on two ICLR 2026 papers:

  • TRINITY – a lightweight coordinator that assigns roles to agents:
    • Thinker – plans,
    • Worker – executes tasks,
    • Verifier – validates results.
  • Conductor – a model trained with RL to discover coordination strategies in natural language (Sakana AI – Fugu).

In practice, this means Fugu can:

  • solve simple tasks on its own,
  • or assemble a team of experts (different models) and coordinate their work,
  • while the user sees only one consolidated response.

3.3. Fugu vs. Fugu Ultra

Sakana Fugu offers two variants:

  1. Fugu – a balance between performance and latency, designed for everyday use. It allows excluding specific agents from the pool (e.g., if you don’t want to use a particular provider).
  2. Fugu Ultra – optimized for maximum response quality on complex tasks (e.g., Kaggle competitions, cybersecurity analysis), using a deeper pool of experts (fixed pool, no exclusion option) (Sakana AI – Fugu).

3.4. Benchmarks: Fugu Ultra vs. GPT-5.5, Opus, and Gemini

According to benchmarks shared by Sakana AI, Fugu Ultra achieves comparable or better results than leading models:

  • SWE Bench Pro (software engineering problem-solving):
  • TerminalBench 2.1 (agentic coding): Fugu Ultra: 82.1.
  • LiveCodeBench Pro: Fugu Ultra: 90.8.
  • GPQA-D (scientific knowledge): Fugu Ultra: 95.5.

This means Fugu doesn’t just combine multiple models—it outperforms them as a system.

3.5. How to Use the Fugu API?

Fugu is available via an OpenAI-compatible API:

Example in Python:

from openai import OpenAI

client = OpenAI(
    base_url="https://api.sakana.ai",
    api_key="sk-...",  # Twój klucz z console.sakana.ai
)

response = client.chat.completions.create(
    model="fugu",  # lub "fugu-ultra"
    messages=[
        {"role": "user", "content": "Wyjaśnij, czym jest Sakana Fugu."}
    ],
)

print(response.choices[0].message.content)

This makes it easy to replace OpenAI GPT with Fugu in existing applications.


4. Sakana Marlin – A Virtual CSO for Ultra-Deep Research

4.1. What Is Marlin?

Sakana Marlin is Sakana AI’s first commercial product that isn’t a language model—it’s an autonomous research agent. Described as “Your Virtual CSO (Chief Strategy Officer)”, it’s a tool for ultra-deep strategic research (Sakana AI – Marlin).

While most AI tools (e.g., ChatGPT Deep Research, Gemini Deep Research) focus on speed (responses in seconds), Marlin deliberately slows down the process:

  • it operates for up to 8 hours of continuous, autonomous reasoning,
  • generates reports ranging from dozens to ~100 pages plus slides for executives,
  • is designed for corporations, financial institutions, think tanks, and consultants (VentureBeat – Sakana Marlin).

4.2. How Does It Work Technically? AB-MCTS and Multi-Model Collaboration

Marlin is built on the AB-MCTS (Adaptive Branching Monte Carlo Tree Search) architecture—a method that enables AI to perform trial-and-error and explore multiple hypotheses simultaneously (Sakana AI – AB-MCTS).

Combined with multiple LLM models (e.g., OpenAI o4-mini, Gemini 2.5 Pro, DeepSeek R1-0528), Marlin:

  • formulates hypotheses,
  • scours the web,
  • resolves contradictions between sources,
  • and delivers exhaustive, expert-level strategic reports.

It’s a product directly rooted in Sakana AI’s earlier research, such as The AI Scientist (a system that automatically generates academic papers) (Sakana AI – The AI Scientist).

4.3. Business Applications

Marlin is designed for tasks like:

  • market and competitor analysis,
  • assessing technology trends,
  • regulatory scenarios (e.g., stablecoins, payment tokenization),
  • AI agent market maps for enterprises (Sakana AI – Marlin).

Example topics Marlin can research:

  • “Scenarios for a blockade of the Strait of Hormuz and their impact on the global economy”,
  • “Impact of stablecoin regulations on payment systems”,
  • “AI agent market map for large corporations”.

Marlin doesn’t replace human decision-making, but it dramatically reduces the time needed for research, allowing teams to focus on strategic choices.


5. Who’s Behind Sakana AI and Why It Matters

5.1. The Founders and Their Expertise

  • David Ha – CEO, known for research in generative models and evolutionary algorithms. Previously led teams at Google Brain and Stability AI.
  • Llion Jones – CTO, co-author of “Attention Is All You Need,” the foundational paper behind modern LLMs.
  • Ren Ito – Chairman, with experience scaling AI startups and global operations.

Their blend of academic and business experience makes Sakana AI both a research lab and a product company.

5.2. Mission: AI for Japan, with Global Reach

Sakana AI isn’t aiming to be just a local provider. Its goal is to:

  • build sovereign AI for Japan—solutions not entirely dependent on foreign suppliers,
  • while competing with global companies on the international stage.

In an interview with Science Japan, Ren Ito said the key is thinking not in terms of “foreign vs. Japan,” but “U.S. West Coast vs. the rest of the world”—and building a world-class company whose technology happens to be based in Japan (Science Japan – Interview with Ren Ito).

5.3. Why Sakana AI Could Change the Future of AI

Sakana AI stands out in several key ways:

  1. Post-training over giant pretraining – instead of spending hundreds of millions on training from scratch, it adapts existing models to local needs. This is more scalable and cost-effective.
  2. Multi-agent systems as a product – Fugu shows that the future of AI may not lie in bigger models, but in better coordination of many models.
  3. Long-term reasoning – Marlin proves AI can operate not just in seconds, but in hours, opening the door to more complex tasks.
  4. Focus on Japan – while most AI companies target the U.S./EU, Sakana AI shows that regional needs can drive innovation.

6. Summary: What Sakana AI Means for Users and Developers

For end users:

  • Sakana Chat offers a free, Japan-optimized chatbot with web search.
  • Sakana Marlin is a tool for deep strategic research for businesses.
  • Sakana Fugu is an API that can replace traditional models in applications, often delivering better quality thanks to multi-agent coordination.

For developers:

  • Fugu provides an OpenAI-compatible API that can be easily integrated into existing tools.
  • Namazu demonstrates how to adapt global models to local needs without sacrificing performance.
  • The multi-agent architecture of Fugu and Marlin’s long-term reasoning could inspire new approaches to AI system design.

Sakana AI is more than just another AI startup. It’s an example of how collective intelligence, evolution, and localized adaptation can change the way we build and use artificial intelligence. If you want to follow their progress, check out:

Facebook X E-mail

Comments

Dodaj komentarz

Explore

Labels

artificial intelligence 11 news 11 Windows 10 browsers 10 Opera 9 Security 9 facebook 8 web applications 8 Automation 7 Software 7 Technology 7 chrome 7 coaching 7 curiosities 7 www 7 Docker 6 Microsoft 6 Mind 6 Web browser 6 entertainment 6 new technologies 6 technology 6 Cybersecurity 5 God 5 Productivity 5 Programming 5 Red Hat 5 automation 5 books 5 Anthropic 4 CentOS 4 LLM 4 Open Source 4 RedHat 4 Vivaldi 4 Windows 10 4 Windows system administration 4 applications 4 containers 4 education 4 health 4 people 4 photography 4 trivia 4 Android 3 BIG DATA 3 Business 3 Claude 3 Claude AI 3 FAQ 3 FIFA 3 Firefox 3 Google projects 3 Local AI 3 OpenAI 3 Personal Development 3 Privacy 3 Programs 3 Ubuntu 3 algorithms 3 bash 3 communication 3 computer science 3 cybersecurity 3 extensions 3 faith 3 games 3 good movie 3 help 3 human 3 interesting websites 3 interface 3 media 3 money 3 n8n 3 network 3 opensource 3 personal competencies 3 personal development 3 programming 3 psychology 3 reading 3 religion 3 security 3 system administration 3 tools 3 virtualization 3 web browser 3 websites 3 AI assistant 2 Administration 2 Asus 2 Career 2 Centos 2 Cloud 2 Configuration 2 Debian 2 DevOps 2 Docker Machine 2 Drones 2 Education 2 Free Red Hat 2 Hardware 2 Intel 2 Intelligence 2 Japan 2 Job Market 2 Machine Learning 2 Netflix 2 Performance 2 Personal Finance 2 Psychology 2 RHEL7 2 RSS 2 Rocky Linux 2 Sakana AI 2 Servers 2 Software Engineering 2 Windows administration 2 Windows errors 2 ansible 2 better life 2 brain 2 chat 2 children 2 cloud storage 2 communicator 2 communities 2 computer intelligence 2 computers 2 conferences 2 creativity 2 curl 2 cyberattacks 2 data 2 death 2 documentary 2 earning 2 emotions 2 file storage 2 fix 2 free application 2 free courses 2 free knowledge from the internet 2 free training 2 future of work 2 genius 2 hacker 2 investments 2 knowledge 2 learning 2 local AI 2 machine learning 2 mind manipulation 2 mind programming 2 mindfulness 2 mobile 2 mobile apps 2 mobile phones 2 motivation 2 movie 2 multimedia 2 open-source 2 personal thoughts 2 photos 2 plugin 2 podcast 2 privacy 2 prompt 2 shell 2 software 2 terminal 2 torrent 2 trick 2 wealth 2 weather 2 web 2 wisdom 2 youtube 2 (Treści etykiet nie zostały podane w treści wejściowej) 1 120B models 1 21st Century Skills 1 2FA 1 64 bit 1 7 1 ACT therapy 1 AGI 1 AI Agents 1 AI Frameworks 1 AI History 1 AI Safety 1 AI agents 1 AI benchmarks 1 AI censorship 1 AI ethics 1 AI future 1 AI governance 1 AI in healthcare 1 AI in sports 1 AI superchips 1 AIMP 1 AMD ROCm 1 Acquisition 1 Alan Watts 1 Alexander Gerst 1 AlmaLinux 1 Alpine Linux 1 Andrej Karpathy 1 Anonymous 1 Apache 1 Apple 1 Apple Silicon 1 Aria AI 1 Audacity 4 1 AutoGen 1 Banking 1 Bash 1 Bill Warner 1 Biotechnology 1 Black Mirror 1 Blackwell B100 1 Blockchain 1 Bonding 1 Bono 1 Business and Finance 1 C++ 1 CPU 1 CUA 1 CUDA 1 Career Development 1 Chat GPT 1 ChatGPT 1 Chemtrails 1 ChildOnlineSafety 1 Claude Fable 1 Coaching 1 Codex 1 Computer-Using Agent 1 Constitutional AI 1 Copilot 1 Copilot for Finance 1 Couching 1 CrewAI 1 Cryptocurrencies 1 Cyberbullying 1 Dario Amodei 1 Darwin 1 Data Science 1 Debugging 1 Deep Learning 1 DeepSeek 1 Deepseek 1 Deluge 1 Devin AI 1 Diagnostics 1 Digitalization 1 Docker containers 1 Drivers 1 Dystrybucje 1 EA GAMES 1 EA SPORTS 1 Economics 1 Email 1 Emigration 1 Enterprise Linux 1 Entrepreneurship 1 Error 1 Excel 1 FIFA 16 1 Fable 1 Fact-checking 1 Fake News 1 Flannel 1 Flynn Effect 1 Football 1 Foundation 1 Free 1 Free Software 1 Free software 1 Fugu Ultra 1 Future 1 Future of Finance 1 Future of Work 1 GDPR 1 GLM-5.2 1 GPT 1 GPT-4 1 GPT-4.5 1 GPU Cloud 1 GUI 1 Gemini 1 Generation Z 1 GitHub 1 Golden Gate 1 Google Assistant 1 Google Gemma 4 12B 1 Google activity 1 GoogleFamilyLink 1 Got Talent 1 Gregory Kurtzer 1 Guide 1 Guides 1 HTML 1 Hardware Requirements 1 Homelab 1 Hygge 1 IAM 1 IBM 1 IDE 1 IQ 1 ISIS 1 ISS 1 IT 1 IT history 1 Intelligent email 1 Internet Browser 1 Internet browser 1 InternetEducation 1 Interview 1 Islam 1 Islamic State 1 Jacquard 1 JavaScript 1 Jboss 1 Jetson Thor price 1 Joel Pearson 1 Kali Linux 1 Kernel 1 Khan Academy 1 Kylian Mbappé 1 LLM Deployment 1 Labor Market 1 Legal regulations 1 LibreOffice 1 Linux diagnostics 1 Londoners 1 MFA 1 MLX 1 Maps 1 MarGib_Film 1 Marek Jankowski 1 Mars helicopter 1 Material Design 1 Matt Pocock 1 Medicine 1 Microsoft 365 1 Military 1 Mindfulness 1 Miłosz Brzeziński 1 MrBallen 1 My take 1 Mythos 1 NTFS 1 NVIDIA 1 NVIDIA Blackwell 1 NVIDIA Jetson Thor 1 National security 1 Navy SEALs 1 Neural Networks 1 New 1 Nginx 1 No comment 1 Node.js 1 Non-profit 1 Notion 1 Nvidia 1 Odysseus 1 Opera Air 1 Opera Neon 1 Opera Touch 1 P2P 1 Pac-Man 1 Pekao S.A 1 Peperclips 1 Perceptron 1 Personal development 1 Philosophy 1 Photoshop 1 Poland 1 Poles 1 PowerShell 1 Project TANGO 1 Proton Drive 1 Puppeteer 1 PyTorch 1 Qt Creator 1 Quotes 1 RHEL8 1 Raspberry PI 1 Raspbian 1 Red Hat 8 1 Red Hat Enterprise Linux Developer Suite 1 RedHat 8 1 Regex 1 Robo-advisors 1 Rust 1 SUSE 1 SafeInternet 1 SaferInternetDay 1 Safety 1 Sakana Fugu 1 Search 1 Security Auditing 1 Self-hosting 1 September 23 2017 1 Server Administration 1 Smart City 1 Snip. 1 Social Media 1 Soli 1 Solo Projects 1 Solopreneurship 1 Something from myself 1 Sound 1 Sovereign AI 1 Sport 1 Steam Deck 1 SysAdmin 1 System Administration 1 Tech 1 TensorFlow 1 The Shack 1 Time Management 1 Tips 1 Tokenomics 1 Tools 1 Tribler 1 Tutorial 1 U2 1 USB 1 Ubuntu 26.04 1 Ubuntu Server 1 VentuSky 1 WBC 1 WSL 3 1 WWDC 2026 1 WWDC26 1 Warsaw 1 Weave 1 Web Scraping 1 Websites 1 Windows update 1 Work 1 Workflow 1 World Cup 1 World Cup 2026 1 World Wide Web 1 X-Files 1 X-files 1 YouTube 1 ZUS 1 ZenFone 1 a drop of motivation 1 about this blog 1 account security 1 achieving goals 1 ad blocking 1 addiction 1 administrator 1 aids 1 animations 1 assertiveness 1 audio 1 audio editing 1 automateit 1 autonomous cars 1 awareness 1 bank 1 bash on windows 1 bat files 1 batch 1 battery 1 beliefs 1 beta 1 better living 1 better quality 1 bin/bash 1 blocking 1 blogger 1 body language 1 bookmarks 1 boot 1 bootable usb 1 boxing 1 brain-computer interfaces 1 business intelligence 1 c# 1 calc 1 campaign 1 cards 1 centralized platforms 1 chemistry 1 clearance 1 clothing industry 1 cmd 1 code editor 1 cognitive psychology 1 coldplay 1 command history 1 command line 1 command prompt 1 comments 1 computer interaction 1 concentration 1 configuration management 1 conntrack 1 console 1 conspiracy 1 conspiracy theories 1 controversial 1 converter 1 corporate world 1 courses 1 courses for free 1 dark mode 1 data security 1 date and time 1 deep learning 1 developer tools 1 digital clothing 1 disqus 1 document 1 dreams 1 drop of motivation 1 dubai 1 dying 1 e-book 1 eBPF 1 economy 1 end of the world 1 end of world 1 energy 1 energy efficiency 1 environment and health 1 ethical AI 1 evolution 1 excel 1 exploitation 1 extreme 1 file sharing 1 file size 1 film zone 1 flash drive 1 flat earth 1 flying 1 food 1 football 1 for sale 1 format change 1 free 1 free software 1 friend location 1 future of humanity 1 future of transport 1 future skills 1 game 1 geoengineering 1 google chat 1 graphics 1 graphics editors 1 growing up 1 hacking 1 happiness 1 hard-link 1 hashing 1 hedonic adaptation 1 helion 1 history 1 hobby 1 home hosting 1 hostname 1 hostnamectl 1 how many people live on earth 1 humanity 1 humor 1 iOS 1 iftop 1 immortality 1 influencer criticism 1 infrastructure 1 innovation 1 installation 1 intelligence 1 internet applications 1 investing 1 javascript 1 kuba wojewódzki 1 labor market 1 language models 1 light 1 login 1 macOS 1 magic 1 make life harder 1 making money 1 material design 1 meditation 1 memory 1 messenger 1 meteorology 1 mobile applications 1 mobile photography 1 mounting 1 mp3 player 1 music 1 music player 1 mysteries 1 net use 1 nethogs 1 network monitoring 1 network resources 1 network security 1 networking 1 neurobiology 1 neuropsychology 1 neurotechnology 1 new life 1 new player 1 new things 1 nftables 1 office 1 onboarding 1 onestep4red 1 online 1 online courses 1 open source 1 operating systems 1 outage 1 paper clips 1 paradox of the fulfilled dream 1 parenting 1 parents 1 password 1 password change 1 password policy 1 password recovery 1 password security 1 pdf 1 penetration testing 1 performance 1 personal data 1 philosophy 1 phishing 1 php 1 plague 1 player 1 poison 1 police 1 predictions 1 promissory notes 1 protection 1 questions 1 radar 1 red 1 relax 1 relaxation 1 remote work 1 reportage 1 rest 1 robotaxi 1 root 1 science 1 scientific facts 1 screen 1 screenshot 1 series 1 show 1 skydive 1 sleep 1 small big company 1 smart clothing 1 smartphone 1 social engineering 1 social media 1 society 1 space 1 sport 1 sports 1 spreadsheet 1 stalking 1 statistics 1 streaming 1 sub-millimeter sensor 1 success 1 symbolic link 1 syngrapha 1 system acceleration 1 tablet 1 talk show 1 technological innovations 1 television 1 terrorism 1 testing 1 the world in numbers 1 threats 1 time management 1 time travel 1 timelapse 1 tips 1 two-factor authentication 1 ubuntu 1 upbringing 1 users 1 viral 1 virtualbox 1 walking 1 walking meetings 1 weather forecasting 1 webmaster 1 windows automation 1 word processing 1 work 1 work automation 1 world 1 world cup 2026 1 world wide web 1 you are a miracle 1 zeitgeist 1

Blog archive

Table of contents