InferX Beta Serverless GPU Inference Platform, Built for Agent-Native Workloads

Endpoint Qwen2.5-Coder-1.5B-Instruct

Lightweight coder

coding low-latency

Metadata

Name
Qwen2.5-Coder-1.5B-Instruct
Provider
Qwen
Parameter Size
1.50B
GPU Count
1
Context Length
2000
Concurrency
146.64x
Cold Start TTFT
Recommended Use Cases
Coding assistant, Code generation
Detailed Intro
testest test

Log In To Use This Endpoint

This public page shows the published endpoint metadata and integration shape. Log in to get a tenant-scoped endpoint URL, inference API key, and the interactive playground. Log in

Integration

Use these values in Dify, OpenWebUI, Continue, OpenCode, or any OpenAI-compatible client that asks for a base URL, API key, and model name.

  1. Copy the API base URL into your client endpoint field.
  2. Copy the model name exactly as shown.
  3. Copy the inference API key.
https://dev1.inferx.net/funccall/<tenant>/endpoints/Qwen2.5-Coder-1.5B-Instruct/v1
Qwen/Qwen2.5-Coder-1.5B-Instruct
<INFERENCE_API_KEY>

An inference API key is required for this endpoint. Until one is available, the sample request below keeps the correct request shape and uses a placeholder token.

Sample REST Call

Model Spec