5 20 136

Sambit Mukherjee

sadhaklal

AI & ML interests

None yet

Recent Activity

upvoted an article 3 days ago

How NVIDIA AI-Q Reached \#1 on DeepResearch Bench I and II

liked a model about 2 months ago

phanerozoic/threshold-xnor

liked a model about 2 months ago

phanerozoic/threshold-nor

View all activity

Organizations

upvoted an article 3 days ago

Article

How NVIDIA AI-Q Reached \#1 on DeepResearch Bench I and II

5 days ago

•

liked 6 models about 2 months ago

upvoted a collection about 2 months ago

Threshold Logic Circuits

Collection

Boolean gates, voting functions, modular arithmetic, and adders as threshold networks. • 269 items • Updated Jan 29 • 1

liked 2 models 5 months ago

Soul-AILab/SoulX-Podcast-1.7B

Text-to-Speech • Updated Dec 18, 2025 • 242 • 231

opendatalab/MinerU2.5-2509-1.2B

Image-Text-to-Text • 1B • Updated Sep 29, 2025 • 96.6k • 338

liked a model 7 months ago

google/gemma-3-12b-it

Image-Text-to-Text • Updated Mar 21, 2025 • 1.93M • • 679

liked 3 datasets 9 months ago

FanqingM/MMIU-Benchmark

Viewer • Updated Aug 8, 2024 • 11.7k • 1.66k • 11

HuggingFaceH4/llava-instruct-mix-vsft

Viewer • Updated Apr 11, 2024 • 273k • 1.73k • 48

theblackcat102/llava-instruct-mix

Viewer • Updated Oct 23, 2023 • 273k • 245 • 12

liked a model 9 months ago

docling-project/SmolDocling-256M-preview

Image-Text-to-Text • Updated Sep 17, 2025 • 74k • 1.61k

liked a Space 9 months ago

SmolDocling

🦆

260

Convert document images into structured text and data

New activity in discord-community/LevelBot 12 months ago

Cannot verify my account in discord

➕ 1

#37 opened 12 months ago by

pdjota

commented on Tool Use, Unified 12 months ago

Excellent article. Very clearly written.

I have one question though. It seems that that model replies with either (i) a text response or (ii) a tool call. However, in the original ReAct paper, there is a "Thought" -> "Action" -> "Observation" cycle. In other words, in response to the user's query, the model first outputs a "Thought" followed by an "Action". How do I implement this (i.e., make the model "think" before performing a tool call)?

The following are the original ReAct prompts for HotpotQA (from the official ReAct GitHub repo): https://raw.githubusercontent.com/ysymyth/ReAct/refs/heads/master/prompts/prompts_naive.json

If you examine these prompts, you'll notice that the "thoughts" come before the "actions".

upvoted an article 12 months ago

Article

Tool Use, Unified

Aug 12, 2024

•

120

posted an update 12 months ago

Post

1995

What happens when you combine the Chain of Thought (CoT) reasoning capabilities of LLMs with a heuristic-guided tree search algorithm? In the Tree of Thoughts (ToT) paper, the authors (Yao et al.) have coupled GPT-4 with tree search algorithms to attack a few tasks on which left-to-right CoT struggles. And the results are impressive. For example, on the "Game of 24" task, while GPT-4 with CoT prompting only managed to solve 4% of tasks, ToT achieved a success rate of 74%.

I've written a blog post that makes the ToT paper easy to understand and implement by taking you through all the details in a step-by-step manner: https://huggingface.co/blog/sadhaklal/tree-of-thoughts

If you are interested in the topics of algorithmic AI, tree search, reasoning, planning, or "System 2" thinking, then you may find this blog post useful.

Sambit Mukherjee

AI & ML interests

Recent Activity

Organizations

sadhaklal's activity

How NVIDIA AI-Q Reached \#1 on DeepResearch Bench I and II

SmolDocling

Cannot verify my account in discord

Tool Use, Unified