• English
  • Hindi
  • Punjabi
  • Marathi
  • German
  • Gujarati
  • Urdu
  • Telugu
  • Bengali
  • Kannada
  • Odia
  • Assamese
  • Nepali
  • Spanish
  • French
  • Japanese
  • Arabic
  • Home
  • Noida
  • National
    • BulletsIn
    • cliQ Explainer
    • Government Policy
    • New India
  • International
    • Middle East
    • Foreign
  • Entertainment
  • Business
    • Tender News
  • Sports
    • IPL2025
  • Services
    • Lifestyle
    • How To
    • Spiritual
      • Festival and Culture
    • Tech
Notification
  • Home
  • Noida
  • National
    • BulletsIn
    • cliQ Explainer
    • Government Policy
    • New India
  • International
    • Middle East
    • Foreign
  • Entertainment
  • Business
    • Tender News
  • Sports
    • IPL2025
  • Services
    • Lifestyle
    • How To
    • Spiritual
      • Festival and Culture
    • Tech
  • Home
  • Noida
  • National
    • BulletsIn
    • cliQ Explainer
    • Government Policy
    • New India
  • International
    • Middle East
    • Foreign
  • Entertainment
  • Business
    • Tender News
  • Sports
    • IPL2025
  • Services
    • Lifestyle
    • How To
    • Spiritual
      • Festival and Culture
    • Tech
  • Noida
  • National
  • International
  • Entertainment
  • Business
  • Sports
CliQ INDIA > International > Foreign > OmniMotion Allows for Better Video Motion Estimation
ForeignInternational

OmniMotion Allows for Better Video Motion Estimation

cliQ India
cliQ India
Share
7 Min Read
SHARE

Newswise — ITHACA, N.Y. — Cornell researchers have developed a new optimization tool to estimate motion throughout an input video, which has potential applications in video editing and generative AI video creation.

The tool, called OmniMotion, is described in a paper, “Tracking Everything, Everywhere, All at Once,” presented at the International Conference on Computer Vision, Oct. 2-6 in Paris.

“There are these two dominant paradigms in motion estimation – optical flow, which is dense but short range, and feature tracking, which is sparse but long range,” said Noah Snavely, associate professor of computer science at Cornell Tech and in the Cornell Ann S. Bowers College of Computing and Information Science. “Our method allows us to have both dense and long-range tracking across time.”

OmniMotion uses what the researchers term “a quasi-3D representation” – a relaxed form of 3D that retains important properties (such as tracking pixels when they pass behind other objects) without the challenges of dynamic 3D reconstruction.

“We found a way to basically have it estimate more qualitative 3D,” Snavely said. “It’s saying, ‘I don’t know exactly where these two objects are in 3D space, but I know that this one is in front of that one.’ You can’t look at it as a 3D model, as things will be distorted, but it captures the ordering relationships between objects.”

The new method takes a small sample of frames and motion estimates to create a complete motion representation for the entire video. Once optimized, the representation can be queried with any pixel in any frame to produce a smooth, accurate motion trajectory across the full video.

This would be useful, Snavely said, when incorporating computer-generated imagery, or CGI, into video-editing.

“If I want to place an object – say a sticker – on a video, then I need to know where it should be in every frame,” he said. “So I place it in the first frame of the video; to avoid having to edit every subsequent frame in a painstaking way, it’d be nice if I could just track where it should be in every frame – as well as if it shouldn’t be there, if there’s something occluding it.”

OmniMotion could also help inform algorithms in generative text-to-video applications, Snavely said.

“Often these text-to-video models aren’t very coherent,” he said. “Objects will change size over the course of the video, or people move in uncanny ways, and that’s because they’re just generating the raw pixels of a video. They don’t have any notion of the underlying dynamics that would result in pixel motion.

“We’re hoping that by providing algorithms for estimating motion in videos, we can  help improve the motion coherence of generated videos,” he said.

Qianqian Wang, Ph.D. ’23, a postdoctoral researcher at the University of California, Berkeley, and a research scientist at Google Research, was lead author. Other co-authors were Bharath Hariharan, assistant professor of computer science at Cornell Bowers CIS; doctoral students Yen-Yu Chang and Ruojin Cai; and Aleksander Holynski, postdoctoral researcher at Berkeley and a scientist at Google Research; and Zhengqi Li of Google Research.

Also at the conference, Cai presented “Doppelgangers: Learning to Disambiguate Images of Similar Structures,” which uses a massive dataset of image pairs to train computer vision applications to distinguish between images that look the same but are not, like different sides of a clock tower or building.

For Doppelgangers, Snavely and his team show how to use existing image annotations stored in the Wikimedia Commons image database to automatically create a large set of labeled image pairs of 3D surfaces.

Doppelgangers comprises a collection of internet photos of landmarks and cultural sites that exhibit repeated patterns and symmetrical structures. The dataset includes a large number of image pairs – each labeled as either positive or negative matching pairs.

“Big Ben or the Eiffel Tower – they kind of look the same from different sides,” Snavely said. “Computer vision just isn’t good enough to tell the sides apart. So we invented a method to help tell when two things look similar but are different, and when two things really are the same.”

In Doppelgangers, a neural network is trained to assess the spatial distribution of key points in an image, to differentiate between pairs of images that look similar but are different — like two different faces of Big Ben — from images of actual identical scene content. This would be useful in 3D reconstruction technology, Snavely said.

“The network likely learns things like whether the backgrounds are the same or different, or if there are other details that differentiate them,” he said. “Then it outputs a probability: Are these really matching, or do they just look like they’re matching? Then we can integrate that with 3D reconstruction pipelines to make better models.”

Cai was lead author on Doppelgangers; in addition to Wang and Hariharan, co-authors were Joseph Tung ’24, a computer science major; and Hadar Averbuch-Elor, assistant professor of electrical engineering at Tel Aviv University.

Both papers were supported in part by grants from the National Science Foundation. In addition to the NSF, the OmniMotion research was supported by an NVIDIA academic hardware grant, and a Google Ph.D. Fellowship to Wang.


http%3A%2F%2Fwww.newswise.com%2Farticles%2Fview%2F800565%2F%3Fsc%3Drsla

You Might Also Like

New documentary decodes 40-year-long mystery of Indian diplomat's killing in UK, unmasks his killer
India hosts first-ever BIMSTEC youth summit in Gandhinagar, focusing on regional cooperation
India’s trade deficit widens sharply in March amid rising oil and gold imports | cliQ Latest
Indian envoy conveys PM Modi's congratulatory message to newly elected Maldivian President Muizzu
Iranian nationals protest outside World Economic Forum venue in Davos

Sign Up For Daily Newsletter

Be keep up! Get the latest breaking news delivered straight to your inbox.
By signing up, you agree to our Terms of Use and acknowledge the data practices in our Privacy Policy. You may unsubscribe at any time.
Share This Article
Facebook Whatsapp Whatsapp Telegram Copy Link Print
Share
What do you think?
Love0
Sad0
Happy0
Angry0
Wink0
Previous Article Shweta Bachchan, Navya wish Amitabh on his 81st birthday
Next Article US Secretary of State Antony Blinken will travel to Israel
Leave a Comment Leave a Comment

Leave a Reply Cancel reply

You must be logged in to post a comment.

Stay Connected

FacebookLike
XFollow
InstagramFollow
YoutubeSubscribe
TelegramFollow
- Advertisement -
Ad imageAd image

Latest News

Bengal Falta Repoll 2026: Massive Security Deployment After Election Controversy | Cliq Latest
National
May 21, 2026
Peddi Promotion Event In Bhopal: Ram Charan And AR Rahman Ready For Mega Show | Cliq Latest
Entertainment
May 21, 2026
Junior NTR Dragon Teaser Out: NTR Stuns Fans With Intense Assassin Avatar | Cliq Latest
Entertainment
May 21, 2026
KKR Vs MI IPL 2026: Manish Pandey And Bowlers Revive Kolkata Playoff Dream | Cliq Latest
Sports
May 21, 2026

//

We are rapidly growing digital news startup that is dedicated to providing reliable, unbiased, and real-time news to our audience.

We are rapidly growing digital news startup that is dedicated to providing reliable, unbiased, and real-time news to our audience.

Sign Up for Our Newsletter

Sign Up for Our Newsletter

Subscribe to our newsletter to get our newest articles instantly!

Follow US

Follow US

© 2026 cliQ India. All Rights Reserved.

CliQ INDIA
  • English – अंग्रेज़ी
  • Hindi – हिंदी
  • Punjabi – ਪੰਜਾਬੀ
  • Marathi – मराठी
  • German – Deutsch
  • Gujarati – ગુજરાતી
  • Urdu – اردو
  • Telugu – తెలుగు
  • Bengali – বাংলা
  • Kannada – ಕನ್ನಡ
  • Odia – ଓଡିଆ
  • Assamese – অসমীয়া
  • Nepali – नेपाली
  • Spanish – Española
  • French – Français
  • Japanese – フランス語
  • Arabic – فرنسي
Welcome Back!

Sign in to your account

Username or Email Address
Password

Lost your password?