LFM2.5 Nova Function Calling

Use the LFM2.5-1.2B-Nova-Function-Calling model to convert natural language requests into structured JSON function calls for tools, APIs, and software agents.

Description

This skill leverages a specialized 1.2B parameter Liquid Neural Network fine-tuned for robust function calling. Despite its compact size, it achieves 97% syntax reliability matching GPT-4o class performance, making it ideal for resource-constrained environments while maintaining high-quality structured output generation.

The model excels at:

Parsing natural language into tool-compatible JSON

Selecting appropriate functions from available tools

Extracting and formatting arguments correctly

Maintaining valid JSON syntax without crashes

Instructions

When the user requests function calling or tool invocation capabilities:

1. **Install Dependencies**

- Ensure `transformers`, `torch`, and `unsloth` libraries are installed

- Use `pip install unsloth transformers torch accelerate` if not present

- Verify CUDA availability for GPU acceleration (optional but recommended)

2. **Load the Model**

- Import required modules: `FastLanguageModel` from `unsloth`, `torch`

- Load model using: `NovachronoAI/LFM2.5-1.2B-Nova-Function-Calling-Full`

- Set max sequence length to 4096 tokens

- Enable 4-bit quantization for memory efficiency: `load_in_4bit=True`

- Call `FastLanguageModel.for_inference(model)` to prepare for generation

3. **Format Input Prompts**

- Use ChatML format with `<|im_start|>` and `<|im_end|>` tags

- Structure: `<|im_start|>user\n{query}\n<|im_end|>\n<|im_start|>assistant`

- Include tool definitions in the system message when available

- Keep user queries clear and specific about desired actions

4. **Generate Function Calls**

- Tokenize the prompt with `return_tensors="pt"`

- Move inputs to appropriate device (CUDA/CPU)

- Generate with `max_new_tokens=128` (adjust for complex calls)

- Enable `use_cache=True` for faster inference

- Parse output between `<tool_call>` and `</tool_call>` tags

5. **Parse and Validate Output**

- Extract JSON from `<tool_call>` tags in assistant response

- Validate JSON structure contains `name` and `arguments` fields

- Check that function name matches available tools

- Verify argument types match expected schema

- Handle malformed outputs gracefully with error messages

6. **Execute Tool Calls**

- Map function names to actual tool implementations

- Unpack arguments from JSON into function parameters

- Execute the tool with provided arguments

- Capture and format tool results

- Return results to continue conversation flow

7. **Alternative: GGUF Deployment**

- For local/edge deployment, download GGUF versions from mradermacher repositories

- Standard GGUF: Broad compatibility, general testing

- Imatrix GGUF: Higher quality for low-VRAM devices (recommended)

- Use with llama.cpp, Ollama, or LM Studio for quantized inference

- Select quantization level based on available memory (Q4_K_M recommended baseline)

Performance Characteristics

**Syntax Reliability**: 97% (GPT-4o class)

**Zero-Shot Semantic Accuracy**: 58% (without tool definitions)

**Training Loss**: 2.63 (converged)

**Resource Requirements**: Runs on phones, Raspberry Pi, older laptops

**Context Window**: 4096 tokens

Example Usage

**Input Query:**

```

I need to calculate the area of a circle with a radius of 5.

```

**Expected Output:**

```json

<tool_call>

{"name": "calculate_circle_area", "arguments": {"radius": 5}}

</tool_call>

```

Constraints

Model performs best with clear, single-intent queries

Requires ChatML format for proper prompt structure

Complex multi-step tool chains may need iterative prompting

Zero-shot performance (58%) improves significantly when tool schemas are provided in context

Output JSON should be validated before execution to handle edge cases

Model is optimized for English language inputs

Technical Notes

Architecture: Liquid Neural Network (LFM 2.5)

Base Model: LiquidAI/LFM2.5-1.2B-Instruct

Training Dataset: Nova-Synapse-Function-Calling (15k curated examples)

Fine-tuning Framework: Unsloth + Hugging Face TRL

License: Apache 2.0 (refer to Liquid AI base model terms for commercial use)

Model Links

Full Model: `NovachronoAI/LFM2.5-1.2B-Nova-Function-Calling-Full`

Standard GGUF: `mradermacher/LFM2.5-1.2B-Nova-Function-Calling-GGUF`

Imatrix GGUF: `mradermacher/LFM2.5-1.2B-Nova-Function-Calling-i1-GGUF`

LFM2.5 Nova Function Calling

Safety Concern

LFM2.5 Nova Function Calling

Description

Instructions

Performance Characteristics

Example Usage

Constraints

Technical Notes

Model Links

Reviews (0)