Drop-in replacement untuk OpenAI API. 6 model premium dari Anthropic. Bayar sekali, pakai sampai quota habis.
| Base URL | https://api.wflab.web.id/v1 |
| API Key | API Key yang Anda terima (sk-xxxx-xxxx-xxxx) |
| Auth header | Authorization: Bearer <api-key> |
| Format | OpenAI-compatible (drop-in replacement) |
| Cek pemakaian | https://api.wflab.web.id/usage |
Weight = pengali credit per token. Lebih besar = lebih boros credit.
| Model ID | Weight | Cocok untuk | |
|---|---|---|---|
kr/claude-sonnet-4.6 |
1× | Termurah | Chat ringan, summary, edit teks, default daily use |
kr/claude-sonnet-4.6-thinking |
2× | Reasoning ringan, refactor sederhana | |
kr/claude-opus-4.6 |
2× | Coding standar, analisa data, review | |
kr/claude-opus-4.6-thinking |
3× | Coding kompleks, debug rumit | |
kr/claude-opus-4.7 |
3× | Coding terbaik (non-thinking), production work | |
kr/claude-opus-4.7-thinking |
5× | Terpintar | Problem solving rumit, deep reasoning, planning |
kr/claude-sonnet-4.6 (1×) untuk semua hal ringankr/claude-opus-4.6 (2×) kalau Sonnet kurang pintar*-thinking cuma kalau perlu reasoning panjangkr/claude-opus-4.7-thinking (5×) untuk pertanyaan singkat — boroskr/claude-sonnet-4.6https://api.wflab.web.id/v1sk-xxxEdit ~/.continue/config.json:
{
"models": [
{
"title": "Sonnet 4.6 (cheap)",
"provider": "openai",
"model": "kr/claude-sonnet-4.6",
"apiBase": "https://api.wflab.web.id/v1",
"apiKey": "sk-xxxx-xxxx-xxxx"
},
{
"title": "Opus 4.7 Thinking",
"provider": "openai",
"model": "kr/claude-opus-4.7-thinking",
"apiBase": "https://api.wflab.web.id/v1",
"apiKey": "sk-xxxx-xxxx-xxxx"
}
]
}
https://api.wflab.web.id/v1sk-xxxhttps://api.wflab.web.id/v1, Key: sk-xxxInstall: pip install openai
from openai import OpenAI
client = OpenAI(
base_url="https://api.wflab.web.id/v1",
api_key="sk-xxxx-xxxx-xxxx",
)
resp = client.chat.completions.create(
model="kr/claude-sonnet-4.6",
messages=[{"role": "user", "content": "Halo"}],
)
print(resp.choices[0].message.content)
Install: npm install openai
import OpenAI from "openai";
const client = new OpenAI({
baseURL: "https://api.wflab.web.id/v1",
apiKey: "sk-xxxx-xxxx-xxxx",
});
const resp = await client.chat.completions.create({
model: "kr/claude-opus-4.7",
messages: [{ role: "user", content: "Tulis fungsi fibonacci" }],
});
console.log(resp.choices[0].message.content);
from langchain_openai import ChatOpenAI
llm = ChatOpenAI(
base_url="https://api.wflab.web.id/v1",
api_key="sk-xxxx-xxxx-xxxx",
model="kr/claude-opus-4.7-thinking",
)
print(llm.invoke("Jelaskan QED").content)
curl https://api.wflab.web.id/v1/chat/completions \
-H "Authorization: Bearer sk-xxxx-xxxx-xxxx" \
-H "Content-Type: application/json" \
-d '{
"model": "kr/claude-sonnet-4.6",
"messages": [{"role":"user","content":"Halo"}]
}'
Tambahkan "stream": true untuk streaming SSE.
Hermes adalah agent yang menghubungkan AI ke channel WhatsApp dan Telegram. Untuk pakai API key kami sebagai LLM backend Hermes, isi konfigurasi LLM Provider di Hermes:
| Provider type | openai / OpenAI Compatible |
| Base URL | https://api.wflab.web.id/v1 |
| API Key | sk-xxxx-xxxx-xxxx (key Anda) |
| Model (default) | kr/claude-sonnet-4.6 (1× hemat credit) |
| Model (premium) | kr/claude-opus-4.7-thinking (5× untuk reasoning) |
| Streaming | aktifkan / stream: true (Hermes biasanya pakai streaming agar reply cepat tampil di chat) |
| Max tokens | 4096–8192 (sesuaikan dengan kebutuhan reply) |
| Timeout | ≥ 120 detik (model thinking butuh waktu lebih lama) |
llm:
provider: openai
base_url: https://api.wflab.web.id/v1
api_key: ${HERMES_LLM_API_KEY} # set di env
model: kr/claude-sonnet-4.6
fallback_model: kr/claude-opus-4.6
stream: true
max_tokens: 4096
timeout: 180
https://api.wflab.web.id/v1/chat/completions support SSE penuh.too_many_active_locations. Kalau Hermes dipindah server, IP lama TTL 5 menit lalu auto-rotate.Web (paling mudah): buka https://api.wflab.web.id/usage → paste API Key → klik Check.
Yang ditampilkan: plan, status, used / quota credit, request count, total token, breakdown per model, dan 30 request terakhir.
curl https://api.wflab.web.id/api/usage \
-H "Authorization: Bearer sk-xxxx-xxxx-xxxx"
Response selalu OpenAI-shape JSON dengan field error.code.
| HTTP | Code | Arti | Tindakan |
|---|---|---|---|
401 |
invalid_api_key |
API Key salah / typo | Cek paste lengkap |
401 |
no_plan |
Key tidak punya plan | Hubungi seller |
403 |
quota_exhausted |
Credit habis | Beli paket baru |
403 |
manual_disabled |
Di-disable seller | Hubungi seller |
403 |
expired |
Plan expired (kalau di-set) | Beli paket baru |
429 |
rate_limit_exceeded |
Lewat rate per menit | Tunggu sesuai Retry-After |
429 |
too_many_concurrent_requests |
Request paralel terlalu banyak | Kurangi paralel sesuai paket |
429 |
too_many_active_locations |
IP berbeda terlalu banyak (5min TTL) | Pakai dari IP yang sudah aktif |
502 |
bad_gateway |
Upstream sementara down | Retry beberapa detik kemudian |
504 |
upstream_timeout |
Upstream lambat | Retry, kalau sering laporkan |
https://api.wflab.web.id/v1kr/claude-sonnet-4.6 untuk test pertama (paling murah)Kalau gagal, error message self-explanatory. Biasanya cuma typo Base URL (jangan lupa /v1 di akhir) atau API Key tidak komplet.