Contributors

沧渊海洋基础大模型：Ocean Foundation Models

Project • Paper • Models • Web • Manual • Overview • Quickstart • Citation

✨ OceanGPT Beginner's Guide officially published!

We have published a detailed beginner's guide for OceanGPT, which will help you quickly grasp the model’s capabilities and begin applying this ocean-focused large model in your work.

🔔News

2025-06-17, we release the OceanGPT-coder-0.6B.
2025-05-29, we deploy the OceanGPT MCP Server to support sonar image interpretation.
2025-04-20, we release the OceanGPT-o-7B and OceanGPT-coder-7B.
2025-02-01, we collect sonar data for model training and test OceanGPT-coder.
2024-12-01, we collect more publicly available sonar data and scientific images for model training.
2024-08-01, we launch bilingual (Chinese-English) multimodal large language model OceanGPT-o with sonar and ocean science image data collection and training.
2024-07-04, we release the OceanGPT-basic-14B/2B and the updated version of OceanGPT-basic-7B (v0.2).
2024-06-04, OceanGPT is accepted by ACL 2024. 🎉🎉
2023-10-04, we release the paper "OceanGPT: A Large Language Model for Ocean Science Tasks" and release OceanGPT-basic-7B (v0.1) based on LLaMA2.
2023-05-01, we launch the OceanGPT (沧渊) project.

Models

Model Name	ModelScope	HuggingFace
OceanGPT-o-7B (based on Qwen, recommended)	7B	7B
OceanGPT-coder-7B (based on Qwen, recommended)	7B	7B
OceanGPT-basic-8B (based on Qwen, recommended)	8B	8B
OceanGPT-basic-14B (based on Qwen, legacy)	14B	14B
OceanGPT-basic-7B (based on Qwen, legacy)	7B	7B
OceanGPT-basic-2B (based on MiniCPM, legacy)	2B	2B
OceanGPT-coder-0.6B (based on Qwen3)	0.6B	0.6B

❗Please note that the ocean domain Q&A in the online demo system (including the video) is based on knowledge base augmentation and a "general-specialized integration" approach, and the generated content differs from that of the open-source models (注意：在线演示系统和视频里的海洋专业问答采用了知识增强与通专结合等技术，因此和开源模型存在差异)！
❗Due to limited computing resources, OceanGPT-o is currently only applicable for natural language interpretation and generation of certain types of sonar images and marine science images. It is recommended to use a GPU that is greater than or equal to 24GB.

Instruction Data

Data Name	HuggingFace	ModelScope
OceanInstruct-v0.2	50K	50K
OceanInstruct-o	50K	50K
OceanInstruct-v0.1	10K	10K

❗Some of the instruction data are synthetic data; we apologize for any inaccuracies that may exist (部分指令数据为合成数据，如存在错误敬请谅解)！

🌟Overview

This is the OceanGPT (沧渊) project, which aims to build ocean foundation model.

⏩Quickstart

conda create -n py3.11 python=3.11
conda activate py3.11
pip install -r requirements.txt

Download the model

Download from HuggingFace

git lfs install
git clone https://huggingface.co/zjunlp/OceanGPT-14B-v0.1

or

huggingface-cli download --resume-download zjunlp/OceanGPT-14B-v0.1 --local-dir OceanGPT-14B-v0.1 --local-dir-use-symlinks False

Download from WiseModel

git lfs install
git clone https://www.wisemodel.cn/zjunlp/OceanGPT-14B-v0.1.git

Download from ModelScope

git lfs install
git clone https://www.modelscope.cn/ZJUNLP/OceanGPT-14B-v0.1.git

Inference

Inference by HuggingFace

from transformers import AutoModelForCausalLM, AutoTokenizer
import torch

device = "cuda" # the device to load the model onto
path = 'YOUR-MODEL-PATH'

model = AutoModelForCausalLM.from_pretrained(
    path,
    torch_dtype=torch.bfloat16,
    device_map="auto"
)
tokenizer = AutoTokenizer.from_pretrained(path)

prompt = "Which is the largest ocean in the world?"
messages = [
    {"role": "system", "content": "You are a helpful assistant."},
    {"role": "user", "content": prompt}
]
text = tokenizer.apply_chat_template(
    messages,
    tokenize=False,
    add_generation_prompt=True
)
model_inputs = tokenizer([text], return_tensors="pt").to(device)

generated_ids = model.generate(
    model_inputs.input_ids,
    max_new_tokens=512
)
generated_ids = [
    output_ids[len(input_ids):] for input_ids, output_ids in zip(model_inputs.input_ids, generated_ids)
]

response = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)[0]

Inference by vllm

from transformers import AutoTokenizer
from vllm import LLM, SamplingParams

path = 'YOUR-MODEL-PATH'

tokenizer = AutoTokenizer.from_pretrained(path)

prompt = "Which is the largest ocean in the world?"
messages = [
    {"role": "system", "content": "You are a helpful assistant."},
    {"role": "user", "content": prompt}
]
text = tokenizer.apply_chat_template(
    messages,
    tokenize=False,
    add_generation_prompt=True
)

sampling_params = SamplingParams(temperature=0.8, top_k=50)
llm = LLM(model=path)

response = llm.generate(text, sampling_params)

🤗Chat with Our Demo on Gradio

Local WebUI Demo

You can easily deploy the interactive interface locally using the code we provide.

🔧 Before running, modify the model path (OceanGPT/OceanGPT-o/OceanGPT-coder's path) in app.py to your local model path.

python app.py

Open https://localhost:7860/ in browser and enjoy the interaction with OceanGPT.

Online Demo

We provide users with an interactive Gradio demo accessible online.

Here is the demo about using OceanGPT:

Input your query (optional: upload an Word/PDF).
Choose the generation hyparameters.
Run and get results.

Here is the demo about using OceanGPT-o:

Input your query and upload an image.
Choose the generation hyparameters.
Run and get results.

Here is the demo about using OceanGPT-coder:

Input your query.
Choose the generation hyparameters.
Run and get code.

Using MCP Server for Sonar Image Caption

The mcp_userver directory contains the Model Context Protocol (MCP) server for OceanGPT to implement some features.

For detailed setup instructions and usage examples, see the MCP server README.

📌Inference

Efficient Inference with sglang, vLLM, ollama, llama.cpp

sglang now officially supports Models based Qwen2.5-VL and Qwen2.5. Click to see.

Install sglang:

pip install --upgrade pip
pip install uv
uv pip install "sglang[all]>=0.4.6.post4"

Launch Server:

import requests
from openai import OpenAI
from sglang.test.test_utils import is_in_ci

if is_in_ci():
    from patch import launch_server_cmd
else:
    from sglang.utils import launch_server_cmd

from sglang.utils import wait_for_server, print_highlight, terminate_process


server_process, port = launch_server_cmd(
    "python3 -m sglang.launch_server --model-path zjunlp/OceanGPT-o-7B --host 0.0.0.0"
)

wait_for_server(f"http://localhost:{port}")

Chat with Model

import requests

url = f"http://localhost:{port}/v1/chat/completions"

data = {
    "model": "Qwen/Qwen2.5-VL-7B-Instruct",
    "messages": [
        {
            "role": "user",
            "content": [
                {"type": "text", "text": "What’s in this image?"},
                {
                    "type": "image_url",
                    "image_url": {
                        "url": "https://github.com/sgl-project/sglang/blob/main/test/lang/example_image.png?raw=true"
                    },
                },
            ],
        }
    ],
    "max_tokens": 300,
}

response = requests.post(url, json=data)
print_highlight(response.text)

vLLM now officially supports Models based Qwen2.5-VL and Qwen2.5. Click to see.

Install vLLM(>=0.7.3):

pip install vllm

Run Example:

MLLM
LLM

ollama now officially supports Models based Qwen2.5. Click to see.

Create a file named Modelfile

FROM ./OceanGPT.gguf
TEMPLATE "[INST] {{ .Prompt }} [/INST]"

Create the model in Ollama:

ollama create example -f Modelfile

Running the model:

ollama run example "What is your favourite condiment?"

llama.cpp now officially supports Models based Qwen2.5-hf convert to gguf. Click to see.

Download OceanGPT PyTorch model from huggingface to "OceanGPT" folder.

Clone llama.cpp and make:

git clone https://github.com/ggml-org/llama.cpp
cd llama.cpp
make llama-cli

And then convert PyTorch model to gguf files:

python convert-hf-to-gguf.py OceanGPT --outfile OceanGPT.gguf

Running the model:

./llama-cli -m OceanGPT.gguf \
    -co -cnv -p "Your prompt" \
    -fa -ngl 80 -n 512

🌻Acknowledgement

OceanGPT (沧渊) is trained based on the open-sourced large language models including Qwen, MiniCPM, LLaMA.

OceanGPT is trained based on the open-sourced data and tools including Moos, UATD, Forward-looking Sonar Detection Dataset, NKSID, SeabedObjects-KLSG, Marine Debris.

Thanks for their great contributions!

Limitations

The model may have hallucination issues.
Due to limited computational resources, OceanGPT-o currently only supports natural language generation for certain types of sonar images and ocean science images. OceanGPT-coder currently only supports MOOS code generation.
We did not optimize the identity and the model may generate identity information similar to that of Qwen/MiniCPM/LLaMA/GPT series models.
The model's output is influenced by prompt tokens, which may result in inconsistent results across multiple attempts.

🚩Citation

Please cite the following paper if you use OceanGPT in your work.

@article{bi2024oceangpt,
  title={OceanGPT: A Large Language Model for Ocean Science Tasks},
  author={Bi, Zhen and Zhang, Ningyu and Xue, Yida and Ou, Yixin and Ji, Daxiong and Zheng, Guozhou and Chen, Huajun},
  journal={arXiv preprint arXiv:2310.02031},
  year={2024}
}

Contributors

Ningyu Zhang, Yida Xue, Zhen Bi, Xiaozhuan Liang, Zhisong Qiu, Kewei Xu, Chenxi Wang, Shumin Deng, Xiangyuan Ru, Jintian Zhang, Shuofei Qiao, Guozhou Zheng, Huajun Chen

Community Contributors: Junjie Zheng

Name		Name	Last commit message	Last commit date
Latest commit History 149 Commits
figs		figs
mcp_server		mcp_server
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
README_CN.md		README_CN.md
app.py		app.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Table of Contents

🔔News

Models

Instruction Data

🌟Overview

⏩Quickstart

Download the model

Download from HuggingFace

Download from WiseModel

Download from ModelScope

Inference

Inference by HuggingFace

Inference by vllm

🤗Chat with Our Demo on Gradio

Local WebUI Demo

Online Demo

Using MCP Server for Sonar Image Caption

📌Inference

Efficient Inference with sglang, vLLM, ollama, llama.cpp

🌻Acknowledgement

Limitations

🚩Citation

Contributors

About

Uh oh!

Releases

Packages

Contributors 10

Languages

License

zjunlp/OceanGPT

Folders and files

Latest commit

History

Repository files navigation

Table of Contents

🔔News

Models

Instruction Data

🌟Overview

⏩Quickstart

Download the model

Download from HuggingFace

Download from WiseModel

Download from ModelScope

Inference

Inference by HuggingFace

Inference by vllm

🤗Chat with Our Demo on Gradio

Local WebUI Demo

Online Demo

Using MCP Server for Sonar Image Caption

📌Inference

Efficient Inference with sglang, vLLM, ollama, llama.cpp

🌻Acknowledgement

Limitations

🚩Citation

Contributors

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 10

Languages

Packages