LogoTop AI Hubs

Shisa AI: Shisa V2 Llama 3.3 70B (free)

Llama3
Text
Free

Shisa V2 Llama 3.3 70B is a bilingual Japanese-English chat model fine-tuned by Shisa.AI on Meta’s Llama-3.3-70B-Instruct base. It prioritizes Japanese language performance while retaining strong English capabilities. The model was optimized entirely through post-training, using a refined mix of supervised fine-tuning (SFT) and DPO datasets including regenerated ShareGPT-style data, translation tasks, roleplaying conversations, and instruction-following prompts. Unlike earlier Shisa releases, this version avoids tokenizer modifications or extended pretraining. Shisa V2 70B achieves leading Japanese task performance across a wide range of custom and public benchmarks, including JA MT Bench, ELYZA 100, and Rakuda. It supports a 128K token context length and integrates smoothly with inference frameworks like vLLM and SGLang. While it inherits safety characteristics from its base model, no additional alignment was applied. The model is intended for high-performance bilingual chat, instruction following, and translation tasks across JA/EN.

Parameters

70B

Context Window

32,768

tokens

Input Price

$0

per 1M tokens

Output Price

$0

per 1M tokens

Capabilities

Model capabilities and supported modalities

Performance

Reasoning

Good reasoning with solid logical foundations

Math

Capable of solving most mathematical problems accurately

Coding

Capable of generating functional code with good practices

Knowledge

Good knowledge foundation across many domains

Modalities

Input Modalities

text

Output Modalities

text

LLM Price Calculator

Calculate the cost of using this model

$0.000000
$0.000000
Input Cost:$0.000000
Output Cost:$0.000000
Total Cost:$0.000000
Estimated usage: 4,500 tokens

Monthly Cost Estimator

Based on different usage levels

Light Usage
$0.0000
~10 requests
Moderate Usage
$0.0000
~100 requests
Heavy Usage
$0.0000
~1000 requests
Enterprise
$0.0000
~10,000 requests
Note: Estimates based on current token count settings per request.
Last Updated: 2025/05/06