Skip to content

Due to the instability of OpenAI services, I plan not to renew my subscription. I will look for other alternatives, such as using NotionAI for document processing, GitHub Copilot for coding, and for some translation and text processing tasks, I choose Ollama.

Ollama Local Large Models#

Ollama is an open-source framework for running large language models (LLMs) locally. It provides developers and researchers with an integrated platform to conveniently build, train, and share their language models.

For more details, refer to the documentation on Github:

https://github.com/ollama/ollama

How to Use Ollama#

Using Ollama is very simple, just a few steps:

  1. Install Ollama by visiting the official Ollama website: https://ollama.com/.

  2. Download the language model you want to use. (Run the command from the Download column in the table below in the terminal)

    https://blog-1259751088.cos.ap-shanghai.myqcloud.com/uPic/CleanShot 2024-03-09 at 19.31.22.png

  3. Use the Ollama API to load the language model.

  4. Call the language model API for predictions.

Models Supported by Ollama#

All the models below can be downloaded. Copy the “Download Command” and run it in the terminal.

Official model list: https://ollama.com/library

ModelParametersSizeDownload CommandUpdate Date
Qwen27B4.4GBollama run qwen2:7b2024/06/07
codestral22B12 GBollama run codestral2024/06/05
Mistral.37B4.1GBollama run mistral:v0.32024/06/05
Yi<9b>9B5.0GBollama run yi:9b2024/06/05
Llama 3.18B4.7GBollama run llama3.12024/07/25
Llama 3.170B40GBollama run llama3.1:70b2024/07/25
Llama 3.1405B231GBollama run llama3.1:405b2024/07/25
Phi 3 Mini3.8B2.3GBollama run phi32024/06/28
Phi 3 Medium14B7.9GBollama run phi3:medium2024/06/28
Gemma 29B5.5GBollama run gemma22024/06/28
Gemma 227B16GBollama run gemma2:27b2024/06/28
Mistral7B4.1GBollama run mistral2024/04/25
Moondream 21.4B829MBollama run moondream2024/06/28
Neural Chat7B4.1GBollama run neural-chat2024/04/25
Starling7B4.1GBollama run starling-lm2024/04/25
Code Llama7B3.8GBollama run codellama2024/04/25
Llama 2 Uncensored7B3.8GBollama run llama2-uncensored2024/04/25
LLaVA7B4.5GBollama run llava2024/04/25
Solar10.7B6.1GBollama run solar2024/04/25

To delete a model, copy the command below and replace “llama2” with the model you want to remove.

ollama rm llama2

Updated 2024/06/07

Finding More Models#

Updated 2024/04/28

💡 Artificial Analysis

Updated with the scoring website recommended by Professor Andrew Ng!

Model & API Providers Analysis | Artificial Analysis

💡 Brother Lin’s Wild Large Model Rankings

A ranking of large model products more suitable for the Chinese audience.

林哥的大模型野榜

💡 Open LLM Leaderboard

Hugging Face is a popular community for open-source models. This is the leaderboard data officially maintained by them.

Open LLM Leaderboard - a Hugging Face Space by HuggingFaceH4

💡 SuperCLUE Overall Rankings

SuperCLUE is a comprehensive evaluation benchmark for Chinese general large models, assessing model capabilities from three different dimensions: basic ability, professional ability, and Chinese-specific ability.

SuperCLUE

💡 MMLU Massive Multitask Language Understanding Benchmark

MMLU stands for Massive Multitask Language Understanding. It is an evaluation of large models’ language understanding capabilities and is one of the most famous large model semantic understanding evaluations, developed by researchers at UC Berkeley.

Papers with Code - MMLU Benchmark (Multi-task Language Understanding)

💡 LLMEval

LLMEval is a large model evaluation benchmark launched by the NLP Lab of Fudan University. The latest LLMEval-3 focuses on evaluating professional knowledge capabilities, covering 13 academic disciplines and over 50 sub-disciplines as defined by the Ministry of Education, including philosophy, economics, law, education, literature, history, science, engineering, agriculture, medicine, military science, management, and arts, totaling about 200,000 standard generative Q&A questions.

LLM-EVAL

Simple Test#

Dialogue effect comparison:

https://blog-1259751088.cos.ap-shanghai.myqcloud.com/uPic/CleanShot 2024-03-09 at 20.03.28.png

Supported Clients#

Cherry Studio#

A more modern LLM client supporting major service providers both domestic and international. It features a smooth interface and integrates services like translation, drawing, and an agent store.

Github address:

https://github.com/kangfenmao/cherry-studio

https://blog-1259751088.cos.ap-shanghai.myqcloud.com/uPic/CleanShot%202024-11-01%20at%2015.35.38@2x.png

Opencat#

Added support for local models in version 2.8.

Go to settings to configure the URL http://localhost<11434>

It’s ready when verification is successful.

Click the avatar in the chat interface to select the model:

NotesOllama#

Allows your Apple Notes to interact using Ollama’s local LLM.

https://github.com/andersrex/notesollama

Select the text you want to interact with in the notebook, and an interaction menu will appear in the bottom right corner of the note.

Lobe Chat#

https://github.com/lobehub/lobe-chat

This is a cross-platform client supporting multiple languages, a plugin system, and self-deployment.

In the new version, it already supports local calls to Ollama. Here is the tutorial page:

Using Ollama in LobeChat · LobeChat Docs · LobeHub

UI preview:

BMO Chatbot#

An AI plugin for Obsidian that uses large language models (LLMs) like Ollama, LM Studio, Anthropic, Google Gemini, Mistral AI, OpenAI, etc., to generate and brainstorm ideas for you while recording your thoughts in Obsidian.

Github address:

https://github.com/longy2k/obsidian-bmo-chatbot

You can also search and install it in the Obsidian plugin store.

💡 Thank you for reading! Feel free to share the article or contact me to exchange ideas.

The RSS address of this site has been updated. Please resubscribe, dear readers!