Close Menu
GeekBlog

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    What's Hot

    Turn Your iPhone 17 Into a Pro Cinema Rig With Blackmagic’s New ProDock

    September 11, 2025

    T-Mobile will give you a free iPhone 17 Pro right now – how the preorder deal works

    September 11, 2025

    Nvidia’s RTX 5000 Super GPUs might not arrive until CES 2026 – and that could be great news for AMD

    September 11, 2025
    Facebook X (Twitter) Instagram Threads
    GeekBlog
    • Home
    • Mobile
    • Reviews
    • Tech News
    • Deals & Offers
    • Gadgets
      • How-To Guides
    • Laptops & PCs
      • AI & Software
    • Blog
    Facebook X (Twitter) Instagram
    GeekBlog
    Home»Tech News»New AI model turns photos into explorable 3D worlds, with caveats
    Tech News

    New AI model turns photos into explorable 3D worlds, with caveats

    Michael ComaousBy Michael ComaousSeptember 4, 2025No Comments2 Mins Read0 Views
    Share Facebook Twitter Pinterest LinkedIn Tumblr Email Copy Link
    A still shot of a 3D scene rendered using Voyager.
    Share
    Facebook Twitter LinkedIn Pinterest Email Copy Link

    Training with automated data pipeline

    Voyager builds on Tencent’s earlier HunyuanWorld 1.0, released in July. Voyager is also part of Tencent’s broader “Hunyuan” ecosystem, which includes the Hunyuan3D-2 model for text-to-3D generation and the previously covered HunyuanVideo for video synthesis.

    To train Voyager, researchers developed software that automatically analyzes existing videos to process camera movements and calculate depth for every frame—eliminating the need for humans to manually label thousands of hours of footage. The system processed over 100,000 video clips from both real-world recordings and the aforementioned Unreal Engine renders.

    A diagram of the Voyager world creation pipeline.


    Credit:

    Tencent


    The model demands serious computing power to run, requiring at least 60GB of GPU memory for 540p resolution, though Tencent recommends 80GB for better results. Tencent published the model weights on Hugging Face and included code that works with both single and multi-GPU setups.

    The model comes with notable licensing restrictions. Like other Hunyuan models from Tencent, the license prohibits usage in the European Union, the United Kingdom, and South Korea. Additionally, commercial deployments serving over 100 million monthly active users require separate licensing from Tencent.

    On the WorldScore benchmark developed by Stanford University researchers, Voyager reportedly achieved the highest overall score of 77.62, compared to 72.69 for WonderWorld and 62.15 for CogVideoX-I2V. The model reportedly excelled in object control (66.92), style consistency (84.89), and subjective quality (71.09), though it placed second in camera control (85.95) behind WonderWorld’s 92.98. WorldScore evaluates world generation approaches across multiple criteria, including 3D consistency and content alignment.

    While these self-reported benchmark results seem promising, wider deployment still faces challenges due to the computational muscle involved. For developers needing faster processing, the system supports parallel inference across multiple GPUs using the xDiT framework. Running on eight GPUs delivers processing speeds 6.69 times faster than single-GPU setups.

    Given the processing power required and the limitations in generating long, coherent “worlds,” it may be a while before we see real-time interactive experiences using a similar technique. But as we’ve seen so far with experiments like Google’s Genie, we’re potentially witnessing very early steps into a new interactive, generative art form.

    caveats explorable Model photos Turns Worlds
    Follow on Google News Follow on Flipboard
    Share. Facebook Twitter Pinterest LinkedIn Tumblr Telegram Email Copy Link
    Previous ArticleGet up to 77 percent off ExpressVPN, ProtonVPN, Surfshark and others
    Next Article Automated Sextortion Spyware Takes Webcam Pics of Victims Watching Porn
    Michael Comaous
    • Website

    Related Posts

    3 Mins Read

    Turn Your iPhone 17 Into a Pro Cinema Rig With Blackmagic’s New ProDock

    4 Mins Read

    T-Mobile will give you a free iPhone 17 Pro right now – how the preorder deal works

    3 Mins Read

    Nvidia’s RTX 5000 Super GPUs might not arrive until CES 2026 – and that could be great news for AMD

    4 Mins Read

    Crispr Offers New Hope for Treating Diabetes

    3 Mins Read

    Pentagon begins deploying new satellite network to link sensors with shooters

    15 Mins Read

    How to choose the best TV for gaming right now

    Top Posts

    8BitDo Pro 3 review: better specs, more customization, minor faults

    August 8, 202528 Views

    What founders need to know before choosing their exit at Disrupt 2025

    August 8, 202516 Views

    Grok rolls out AI video creator for X with bonus “spicy” mode

    August 7, 202514 Views
    Stay In Touch
    • Facebook
    • YouTube
    • TikTok
    • WhatsApp
    • Twitter
    • Instagram
    Latest Reviews

    Subscribe to Updates

    Get the latest tech news from FooBar about tech, design and biz.

    Most Popular

    8BitDo Pro 3 review: better specs, more customization, minor faults

    August 8, 202528 Views

    What founders need to know before choosing their exit at Disrupt 2025

    August 8, 202516 Views

    Grok rolls out AI video creator for X with bonus “spicy” mode

    August 7, 202514 Views
    Our Picks

    Turn Your iPhone 17 Into a Pro Cinema Rig With Blackmagic’s New ProDock

    September 11, 2025

    T-Mobile will give you a free iPhone 17 Pro right now – how the preorder deal works

    September 11, 2025

    Nvidia’s RTX 5000 Super GPUs might not arrive until CES 2026 – and that could be great news for AMD

    September 11, 2025

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    Facebook X (Twitter) Instagram Pinterest Threads
    • About Us
    • Contact us
    • Privacy Policy
    • Disclaimer
    • Terms and Conditions
    © 2025 geekblog. Designed by Pro.

    Type above and press Enter to search. Press Esc to cancel.