Accessibility Statement Skip Navigation
  • Resources
  • Investor Relations
  • Journalists
  • Agencies
  • Client Login
  • Send a Release
Return to PR Newswire homepage
  • News
  • Products
  • Contact
When typing in this field, a list of search results will appear and be automatically updated as you type.

Searching for your content...

No results found. Please change your search terms and try again.
  • News in Focus
      • Browse News Releases

      • All News Releases
      • All Public Company
      • English-only
      • News Releases Overview

      • Multimedia Gallery

      • All Multimedia
      • All Photos
      • All Videos
      • Multimedia Gallery Overview

      • Trending Topics

      • All Trending Topics
  • Business & Money
      • Auto & Transportation

      • All Automotive & Transportation
      • Aerospace, Defense
      • Air Freight
      • Airlines & Aviation
      • Automotive
      • Maritime & Shipbuilding
      • Railroads and Intermodal Transportation
      • Supply Chain/Logistics
      • Transportation, Trucking & Railroad
      • Travel
      • Trucking and Road Transportation
      • Auto & Transportation Overview

      • View All Auto & Transportation

      • Business Technology

      • All Business Technology
      • Blockchain
      • Broadcast Tech
      • Computer & Electronics
      • Computer Hardware
      • Computer Software
      • Data Analytics
      • Electronic Commerce
      • Electronic Components
      • Electronic Design Automation
      • Financial Technology
      • High Tech Security
      • Internet Technology
      • Nanotechnology
      • Networks
      • Peripherals
      • Semiconductors
      • Business Technology Overview

      • View All Business Technology

      • Entertain­ment & Media

      • All Entertain­ment & Media
      • Advertising
      • Art
      • Books
      • Entertainment
      • Film and Motion Picture
      • Magazines
      • Music
      • Publishing & Information Services
      • Radio & Podcast
      • Television
      • Entertain­ment & Media Overview

      • View All Entertain­ment & Media

      • Financial Services & Investing

      • All Financial Services & Investing
      • Accounting News & Issues
      • Acquisitions, Mergers and Takeovers
      • Banking & Financial Services
      • Bankruptcy
      • Bond & Stock Ratings
      • Conference Call Announcements
      • Contracts
      • Cryptocurrency
      • Dividends
      • Earnings
      • Earnings Forecasts & Projections
      • Financing Agreements
      • Insurance
      • Investments Opinions
      • Joint Ventures
      • Mutual Funds
      • Private Placement
      • Real Estate
      • Restructuring & Recapitalization
      • Sales Reports
      • Shareholder Activism
      • Shareholder Meetings
      • Stock Offering
      • Stock Split
      • Venture Capital
      • Financial Services & Investing Overview

      • View All Financial Services & Investing

      • General Business

      • All General Business
      • Awards
      • Commercial Real Estate
      • Corporate Expansion
      • Earnings
      • Environmental, Social and Governance (ESG)
      • Human Resource & Workforce Management
      • Licensing
      • New Products & Services
      • Obituaries
      • Outsourcing Businesses
      • Overseas Real Estate (non-US)
      • Personnel Announcements
      • Real Estate Transactions
      • Residential Real Estate
      • Small Business Services
      • Socially Responsible Investing
      • Surveys, Polls and Research
      • Trade Show News
      • General Business Overview

      • View All General Business

  • Science & Tech
      • Consumer Technology

      • All Consumer Technology
      • Artificial Intelligence
      • Blockchain
      • Cloud Computing/Internet of Things
      • Computer Electronics
      • Computer Hardware
      • Computer Software
      • Consumer Electronics
      • Cryptocurrency
      • Data Analytics
      • Electronic Commerce
      • Electronic Gaming
      • Financial Technology
      • Mobile Entertainment
      • Multimedia & Internet
      • Peripherals
      • Social Media
      • STEM (Science, Tech, Engineering, Math)
      • Supply Chain/Logistics
      • Wireless Communications
      • Consumer Technology Overview

      • View All Consumer Technology

      • Energy & Natural Resources

      • All Energy
      • Alternative Energies
      • Chemical
      • Electrical Utilities
      • Gas
      • General Manufacturing
      • Mining
      • Mining & Metals
      • Oil & Energy
      • Oil and Gas Discoveries
      • Utilities
      • Water Utilities
      • Energy & Natural Resources Overview

      • View All Energy & Natural Resources

      • Environ­ment

      • All Environ­ment
      • Conservation & Recycling
      • Environmental Issues
      • Environmental Policy
      • Environmental Products & Services
      • Green Technology
      • Natural Disasters
      • Environ­ment Overview

      • View All Environ­ment

      • Heavy Industry & Manufacturing

      • All Heavy Industry & Manufacturing
      • Aerospace & Defense
      • Agriculture
      • Chemical
      • Construction & Building
      • General Manufacturing
      • HVAC (Heating, Ventilation and Air-Conditioning)
      • Machinery
      • Machine Tools, Metalworking and Metallurgy
      • Mining
      • Mining & Metals
      • Paper, Forest Products & Containers
      • Precious Metals
      • Textiles
      • Tobacco
      • Heavy Industry & Manufacturing Overview

      • View All Heavy Industry & Manufacturing

      • Telecomm­unications

      • All Telecomm­unications
      • Carriers and Services
      • Mobile Entertainment
      • Networks
      • Peripherals
      • Telecommunications Equipment
      • Telecommunications Industry
      • VoIP (Voice over Internet Protocol)
      • Wireless Communications
      • Telecomm­unications Overview

      • View All Telecomm­unications

  • Lifestyle & Health
      • Consumer Products & Retail

      • All Consumer Products & Retail
      • Animals & Pets
      • Beers, Wines and Spirits
      • Beverages
      • Bridal Services
      • Cannabis
      • Cosmetics and Personal Care
      • Fashion
      • Food & Beverages
      • Furniture and Furnishings
      • Home Improvement
      • Household, Consumer & Cosmetics
      • Household Products
      • Jewelry
      • Non-Alcoholic Beverages
      • Office Products
      • Organic Food
      • Product Recalls
      • Restaurants
      • Retail
      • Supermarkets
      • Toys
      • Consumer Products & Retail Overview

      • View All Consumer Products & Retail

      • Entertain­ment & Media

      • All Entertain­ment & Media
      • Advertising
      • Art
      • Books
      • Entertainment
      • Film and Motion Picture
      • Magazines
      • Music
      • Publishing & Information Services
      • Radio & Podcast
      • Television
      • Entertain­ment & Media Overview

      • View All Entertain­ment & Media

      • Health

      • All Health
      • Biometrics
      • Biotechnology
      • Clinical Trials & Medical Discoveries
      • Dentistry
      • FDA Approval
      • Fitness/Wellness
      • Health Care & Hospitals
      • Health Insurance
      • Infection Control
      • International Medical Approval
      • Medical Equipment
      • Medical Pharmaceuticals
      • Mental Health
      • Pharmaceuticals
      • Supplementary Medicine
      • Health Overview

      • View All Health

      • Sports

      • All Sports
      • General Sports
      • Outdoors, Camping & Hiking
      • Sporting Events
      • Sports Equipment & Accessories
      • Sports Overview

      • View All Sports

      • Travel

      • All Travel
      • Amusement Parks and Tourist Attractions
      • Gambling & Casinos
      • Hotels and Resorts
      • Leisure & Tourism
      • Outdoors, Camping & Hiking
      • Passenger Aviation
      • Travel Industry
      • Travel Overview

      • View All Travel

  • Policy & Public Interest
      • Policy & Public Interest

      • All Policy & Public Interest
      • Advocacy Group Opinion
      • Animal Welfare
      • Congressional & Presidential Campaigns
      • Corporate Social Responsibility
      • Domestic Policy
      • Economic News, Trends, Analysis
      • Education
      • Environmental
      • European Government
      • FDA Approval
      • Federal and State Legislation
      • Federal Executive Branch & Agency
      • Foreign Policy & International Affairs
      • Homeland Security
      • Labor & Union
      • Legal Issues
      • Natural Disasters
      • Not For Profit
      • Patent Law
      • Public Safety
      • Trade Policy
      • U.S. State Policy
      • Policy & Public Interest Overview

      • View All Policy & Public Interest

  • People & Culture
      • People & Culture

      • All People & Culture
      • Aboriginal, First Nations & Native American
      • African American
      • Asian American
      • Children
      • Diversity, Equity & Inclusion
      • Hispanic
      • Lesbian, Gay & Bisexual
      • Men's Interest
      • People with Disabilities
      • Religion
      • Senior Citizens
      • Veterans
      • Women
      • People & Culture Overview

      • View All People & Culture

      • In-Language News

      • Arabic
      • español
      • português
      • Česko
      • Danmark
      • Deutschland
      • España
      • France
      • Italia
      • Nederland
      • Norge
      • Polska
      • Portugal
      • Россия
      • Slovensko
      • Suomi
      • Sverige
  • Explore Our Platform
  • Plan Campaigns
  • Create with AI
  • Distribute Press Releases
  • Amplify Content
  • All Products
  • General Inquiries
  • Editorial Bureaus
  • Partnerships
  • Media Inquiries
  • Worldwide Offices
  • Hamburger menu
  • PR Newswire: news distribution, targeting and monitoring
  • Send a Release
    • ALL CONTACT INFO
    • Contact Us

      888-776-0942
      from 8 AM - 10 PM ET

  • Send a Release
  • Client Login
  • Resources
  • Blog
  • Journalists
  • RSS
  • News in Focus
    • Browse All News
    • Multimedia Gallery
    • Trending Topics
  • Business & Money
    • Auto & Transportation
    • Business Technology
    • Entertain­ment & Media
    • Financial Services & Investing
    • General Business
  • Science & Tech
    • Consumer Technology
    • Energy & Natural Resources
    • Environ­ment
    • Heavy Industry & Manufacturing
    • Telecomm­unications
  • Lifestyle & Health
    • Consumer Products & Retail
    • Entertain­ment & Media
    • Health
    • Sports
    • Travel
  • Policy & Public Interest
  • People & Culture
    • People & Culture
  • Send a Release
  • Client Login
  • Resources
  • Blog
  • Journalists
  • RSS
  • Explore Our Platform
  • Plan Campaigns
  • Create with AI
  • Distribute Press Releases
  • Amplify Content
  • All Products
  • Send a Release
  • Client Login
  • Resources
  • Blog
  • Journalists
  • RSS
  • General Inquiries
  • Editorial Bureaus
  • Partnerships
  • Media Inquiries
  • Worldwide Offices
  • Send a Release
  • Client Login
  • Resources
  • Blog
  • Journalists
  • RSS

Skywork UniPic 2.0 Goes Open-Source: A Leap Forward in Unified Multimodal AI


News provided by

Skywork AI pte ltd

Aug 13, 2025, 04:24 ET

Share this article

Share toX

Share this article

Share toX

SINGAPORE, Aug. 13, 2025 /PRNewswire/ -- The SkyWork AI Technology Release Week officially kicked off on August 11. From August 11 to August 15, SkyWork releases one new model each day for five consecutive days, covering cutting-edge models for core multimodal AI scenarios. Skywork has already launched the SkyReels-A3, Matrix-Game 2.0, and Matrix-3D models.

Continue Reading
Diagram: Core modules of Skywork UniPic 2.0
Diagram: Core modules of Skywork UniPic 2.0

On August 13, the Skywork UniPic 2.0 model was officially open-sourced. It is an efficient training and inference framework for unified multimodal modeling, designed with lightweight generation and editing modules while integrating multimodal understanding models for joint training. This equips it with unified core capabilities—understanding, image generation, and editing—with the goal of achieving an "efficient, high-quality, and unified" multimodal generative model.

Skywork UniPic 2.0 and its model series are now fully open-source, releasing model weights, inference code, and optimization strategies. They will enable developers and researchers to rapidly deploy and develop multimodal applications.

Project homepage:

https://unipic-v2.github.io/

Technical report:

https://github.com/SkyworkAI/UniPic/blob/main/UniPic-2/assets/pdf/UNIPIC2.pdf

GitHub:

https://github.com/SkyworkAI/UniPic/tree/main/UniPic-2

HuggingFace Gradio:

https://huggingface.co/spaces/Skywork/UniPic2-Metaquery

HuggingFace Model:

https://huggingface.co/Skywork/UniPic2-SD3.5M-Kontext-2B; https://huggingface.co/Skywork/UniPic2-Metaquery-9B

Skywork UniPic 2.0 consists of three core modules:

Image generation & editing: Based on the SD3.5-Medium architecture, the originally text-only model has been upgraded to process both text and image inputs simultaneously. Through training on high-quality image generation and editing datasets, its functionality has evolved from standalone image generation to integrated generation and editing capabilities.

Unified model capability : By freezing the image generation/editing module and leveraging a multimodal model (Qwen2.5-VL-7B) with a pre-trained connector, we have established integrated understanding/generation/editing capabilities. Through joint fine-tuning of both the connector and the image generation/editing module, a unified model capable of seamless understanding, generation, and editing has been achieved.

Post-training for image generation & editing: To boost overall performance, we have developed a Flow-GRPO-based progressive dual-task reinforcement strategy. This approach achieves collaborative optimization of generation and editing tasks without cross-interference, yielding performance gains beyond standard pre-training.

The upgraded Skywork UniPic 2.0 delivers the following key advantages:

Lightweight yet high-performance generation module:

Built on the 2B-parameter SD3.5-Medium architecture, our generation module surpasses competitors in both image generation and editing benchmarks – including models like Bagel (7B params), OmniGen2 (4B params), UniWorld-V1 (12B params), and Flux-kontext.

Enhanced reinforcement learning capability:

Our groundbreaking Flow-GRPO-based progressive dual-task reinforcement strategy significantly enhances the model's ability to interpret complex instructions, and maintain consistency across image generation and editing tasks. All while enabling collaborative optimization without cross-task interference.

Unified architecture with scalable adaptation

The system features seamless end-to-end integration of the Kontext image generation/editing model with multimodal architectures. Through lightweight connector fine-tuning, users can rapidly deploy unified understanding-generation-editing models while further improving both generation and editing performance.

The UniPic2-SD3.5M-Kontext model achieves remarkable performance despite its compact 2B parameter size. In comprehensive benchmarks, it surpasses both Flux.dev (12B parameters) in image generation metrics and Flux-Kontext (12B parameters) in editing performance. Furthermore, it outperforms nearly all existing unified models - including UniWorld-V1 (19B parameters) and Bagel (14B parameters) - across both generation and editing tasks.

When extended into the unified UniPic2-Metaquery architecture, the model demonstrates additional performance gains, showcasing exceptional scalability beyond its already impressive baseline capabilities.

Skywork UniPic 2.0's exceptional understanding, generation, and editing capabilities are powered by the Skywork team's groundbreaking optimizations across all training stages – from pre-training and co-training to post-training refinement.

Pre-Training (image generation/editing model)

SD3.5-Medium was initially trained to synthesize images from both textual instructions and reference images while preserving its original architecture. The system processes text inputs (encoded into instruction representations via the text encoder) and reference images (compressed into latent variables by the VAE and projected as context tokens). These components are then concatenated with the target image's noise tokens into a unified sequence, where the model's inherent positional encoding maintains clear differentiation between reference and target tokens. This methodology retains SD3.5M's native structure while simultaneously enabling both text-to-image (T2I) generation and text-conditioned image editing (I2I).

Joint-Training

Starting from our pre-trained image generation/editing model, we implement the Metaquery framework to achieve cross-modal alignment between Qwen2.5-VL (multimodal) and the image synthesis model, thereby creating a unified architecture. This integration is achieved through two key processes:

Connector pre-training

We substituted SD3.5M's original T5 text encoder with Qwen2.5-VL and a Connector, maintaining frozen weights in both Qwen2.5-VL and SD3.5M's DiT backbone. The Connector underwent pre-training on 100M+ curated image-generation samples to establish precise feature-space alignment between Qwen2.5-VL's transformed outputs (via the Connector) and SD3.5M's DiT input expectations.

Joint SFT training

Following connector pre-training, we replaced SD3.5M with the pre-trained UniPic2-SD3.5M-Kontext model (specialized in image generation/editing), then unfreezed both the connector and UniPic2-SD3.5M-Kontext parameters. Using high-quality generation and editing datasets, we jointly trained the connector and Kontext model to achieve optimal unified performance. The resulting UniPic2-Metaquery model not only preserves the base multimodal model's comprehension capabilities but also exhibits superior generation and editing performance compared to the standalone Kontext model.

Post-training: Multi-task reinforcement learning for concurrent generation/editing enhancement

Traditional multi-task RL often faces performance trade-offs, where optimizing one task compromises another. To overcome this limitation, we pioneered a progressive Flow-GRPO-based dual-task reinforcement strategy that achieves breakthrough concurrent optimization of text-to-image generation and image editing within a unified architecture. This represents the first demonstrated instance of interference-free, synergistic task improvement in multimodal model development.

As a pioneer in AI technology, Skywork continues to redefine the frontiers of artificial intelligence. In recent months, we have open-sourced multiple state-of-the-art foundation models that established new industry standards, including SkyReels-V1: the first video generation model specialized for AI-driven short film production; SkyReels-V2: the world's first unlimited-duration cinematic generation model employing a diffusion-forcing framework; and SkyReels-A3: an audio-driven portrait video generation model.

In multimodal AI development, Skywork has introduced two groundbreaking advancements: (1) the Skywork-R1V series—a 38B-parameter multimodal reasoning model that effectively bridges textual and visual reasoning while matching the performance of significantly larger proprietary models, and (2) pioneering spatial intelligence systems including the Matrix-Game 2.0 interactive world model and Matrix-3D generative world model.

Explore more open-source models in the Skywork family:

https://huggingface.co/Skywork

SOURCE Skywork AI pte ltd

WANT YOUR COMPANY'S NEWS FEATURED ON PRNEWSWIRE.COM?

icon3
440k+
Newsrooms &
Influencers
icon1
9k+
Digital Media
Outlets
icon2
270k+
Journalists
Opted In
GET STARTED

Modal title

Also from this source

Mureka V7.5 Goes Live: Elevating AI Music Creation to New Heights

Mureka V7.5 Goes Live: Elevating AI Music Creation to New Heights

The SkyWork AI Technology Release Week officially kicked off on August 11. From August 11 to August 15, one new model was launched each day for five...

Skywork Deep Research Agent Major Upgrade: Delivering Enhanced Multimodality, Superior Output Quality, and Optimized Efficiency

Skywork Deep Research Agent Major Upgrade: Delivering Enhanced Multimodality, Superior Output Quality, and Optimized Efficiency

The SkyWork AI Technology Release Week officially kicked off on August 11. From August 11 to August 15, SkyWork will release one new model each day...

More Releases From This Source

Explore

Computer & Electronics

Computer & Electronics

Computer Software

Computer Software

Computer Software

Computer Software

Artificial Intelligence

Artificial Intelligence

News Releases in Similar Topics

Contact PR Newswire

  • Call PR Newswire at 888-776-0942
    from 8 AM - 9 PM ET
  • Chat with an Expert
  • General Inquiries
  • Editorial Bureaus
  • Partnerships
  • Media Inquiries
  • Worldwide Offices

Products

  • For Marketers
  • For Public Relations
  • For IR & Compliance
  • For Agency
  • All Products

About

  • About PR Newswire
  • About Cision
  • Become a Publishing Partner
  • Become a Channel Partner
  • Careers
  • Accessibility Statement
  • APAC
  • APAC - Simplified Chinese
  • APAC - Traditional Chinese
  • Brazil
  • Canada
  • Czech
  • Denmark
  • Finland
  • France
  • Germany
  • India
  • Indonesia
  • Israel
  • Italy
  • Japan
  • Korea
  • Mexico
  • Middle East
  • Middle East - Arabic
  • Netherlands
  • Norway
  • Poland
  • Portugal
  • Russia
  • Slovakia
  • Spain
  • Sweden
  • United Kingdom
  • Vietnam

My Services

  • All New Releases
  • Platform Login
  • ProfNet
  • Data Privacy

Do not sell or share my personal information:

  • Submit via [email protected] 
  • Call Privacy toll-free: 877-297-8921

Contact PR Newswire

Products

About

My Services
  • All News Releases
  • Platform Login
  • ProfNet
Call PR Newswire at
888-776-0942
  • Terms of Use
  • Privacy Policy
  • Information Security Policy
  • Site Map
  • RSS
  • Cookies
Copyright © 2025 Cision US Inc.