CoSo

Source code for "Towards Efficient Online Tuning of VLM Agents via Counterfactual Soft Reinforcement Learning"

CoSo

Open in Github
Source code for "Towards Efficient Online Tuning of VLM Agents via Counterfactual Soft Reinforcement Learning"

Github Data

Star 5
Fork 0
Subscriber 1
Language Python
Size 7.5K

owner

created at 2025-05-01
updated at 2025-06-12
pushed at 2025-06-12
LOGO
Developers
Ranking
Top AI Developers

TOP AI Developers by monthly star count

Top AI Organizations

TOP AI Organization Account by AI repo star count

Top AI Project

Top AI Project by Category star count

Top Growing Speed

Top Growing Speed list by the speed of gaining stars

The Least Known Devs

Top List of who create influential repos with little people known

A Year Without Refresh

Projects and developers that are thriving yet have not been updated for a long time.

Graph Report
  1. Home
  2. Developers
  3. langfengQ

langfengQ

PhD @ NTU Singapore | My research focuses on reinforcement learning (RL), large language models (LLMs), LLM post-training, and LLM-based agents.
SingaporeNanyang Technological University

Github Data

Followers 28
Following 8

Links

https://x.com/langfengqhttps://langfengq.github.io/

AI Project

Public repos: 14Public gists: 0

CoSo

Source code for "Towards Efficient Online Tuning of VLM Agents via Counterfactual Soft Reinforcement Learning"
star: 5fork: 0
language: Python
created at: 2025-05-01
updated at: 2025-06-12

verl-agent

verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"
star: 409fork: 26
language: Python
created at: 2025-03-23
updated at: 2025-06-24
LOGO
devface, Find The Topnotch AI Developers
devface.ai - Discover the Top AI hackers & Projects | Product Hunt
Developers
Top Open Source AI Dev Influencers
Top Open Source AI Organization Influencers
Top AI Project by Categories
Top Growing Speed Open Source Influencers
The Least Known Influencers
A Year Without Refresh
Global Developer Distribution Leaderboard
Read More
FAQ
The AI Innovator: How Andrej Karpathy is Shaping the AI community
About
Privacy Policy
Terms
Contact Us
[email protected]
Copyright ©2025 devface.ai All Rights Reserved.