2 48 115

Joshua Chris

KrisKale45

AI & ML interests

None yet

Recent Activity

liked a model 5 days ago

jan-hq/Solar-10.7B-SLERP

liked a model 7 days ago

Menlo/Jan-nano

upvoted a paper 7 days ago

In-Video Instructions: Visual Signals as Generative Control

View all activity

Organizations

None yet

upvoted a paper 7 days ago

In-Video Instructions: Visual Signals as Generative Control

Paper • 2511.19401 • Published 22 days ago • 30

upvoted a paper 26 days ago

VisPlay: Self-Evolving Vision-Language Models from Images

Paper • 2511.15661 • Published 27 days ago • 42

upvoted a paper 3 months ago

Why Language Models Hallucinate

Paper • 2509.04664 • Published Sep 4 • 193

upvoted a paper 4 months ago

rStar2-Agent: Agentic Reasoning Technical Report

Paper • 2508.20722 • Published Aug 28 • 116

upvoted an article 4 months ago

Article

Welcome GPT OSS, the new open-source model family from OpenAI!

Aug 5

•

509

upvoted 5 papers 5 months ago

VisionThink: Smart and Efficient Vision Language Model via Reinforcement Learning

Paper • 2507.13348 • Published Jul 17 • 77

Replacing thinking with tool usage enables reasoning in small language models

Paper • 2507.05065 • Published Jul 7 • 15

Robust 3D-Masked Part-level Editing in 3D Gaussian Splatting with Regularized Score Distillation Sampling

Paper • 2507.11061 • Published Jul 15 • 37

Coding Triangle: How Does Large Language Model Understand Code?

Paper • 2507.06138 • Published Jul 8 • 21

VLM2Vec-V2: Advancing Multimodal Embedding for Videos, Images, and Visual Documents

Paper • 2507.04590 • Published Jul 7 • 16

upvoted an article 5 months ago

Article

SmolLM3: smol, multilingual, long-context reasoner

Jul 8

•

737

upvoted a paper 7 months ago

MME-VideoOCR: Evaluating OCR-Based Capabilities of Multimodal LLMs in Video Scenarios

Paper • 2505.21333 • Published May 27 • 38

upvoted an article 7 months ago

Article

nanoVLM: The simplest repository to train your VLM in pure PyTorch

May 21

•

238

upvoted a paper 7 months ago

PrimitiveAnything: Human-Crafted 3D Primitive Assembly Generation with Auto-Regressive Transformer

Paper • 2505.04622 • Published May 7 • 27

upvoted an article 8 months ago

Article

Tiny Agents: an MCP-powered agent in 50 lines of code

Apr 25

•

303

upvoted an article 9 months ago

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

Mar 12

•

477

upvoted 3 collections 11 months ago

upvoted an article 11 months ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

Jan 28

•

887

Joshua Chris

AI & ML interests

Recent Activity

Organizations

KrisKale45's activity

Welcome GPT OSS, the new open-source model family from OpenAI!

SmolLM3: smol, multilingual, long-context reasoner

nanoVLM: The simplest repository to train your VLM in pure PyTorch

Tiny Agents: an MCP-powered agent in 50 lines of code

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

Open-R1: a fully open reproduction of DeepSeek-R1