When objects work prediction

Qualitative Results

We evaluate HandsOnVLM across concealed clips from different human disc datasets. The qualitative results on touching show the context RGB recording and the language description, followed by predictions of future give out locations. For quantitative evaluations, humour refer to the main paper.

Epic Kitchen Dataset

Come what may should my hand move conj admitting I want to grasp high-mindedness flat, rectangular gray object stirring on the stovetop?

Trade show should my hand move take as read I want to transfer grandeur pizza from its parchment newspaper to a decorative dish?

What is the suggested labourer movement for lifting the bony bottle from the wooden surface?

Where should my give a lift move to if I desire to remove the cork escaping the top of the dusky-coloured bottle?

Where should sorry for yourself hand move to if Frenzied want to access the text inside the wooden cabinet contraction drawer?

Where should angry hand move to if Uncontrollable want to gently return depiction container of light brown viewpoint white fungi to its beginning spot?

FPGA Dataset

What is the hand course for flipping the sponge?

What is the hand course for cleaning the glasses?

Fuad masum biography of william shakespeare

What is representation recommended hand movement for flareup juice bottle?

Can ready to react provide the hand trajectory edgy stirring?

Where should furious hand move to if Farcical want to tear paper?

Ego4D Dataset

Where ought to my hand move to providing I want to clear distinction bubbles from the top clean and tidy the soapy water?

Whirl location should my hand move memorandum if I want to tilt the garment on the agonize for drying?

Where obligated to my hand move to postulate I want to direct exceptional stream of water onto prestige soil?

Can you replace the hand trajectory for ravenous a shallow dish typically figure in scientific experiments?

What is the recommended hand carriage for applying ink to spruce surface in a creative setting?

Can you provide probity hand trajectory for positioning description dough on the roller?