AI Glossary

The Two Metadata Problem

The absence of a common metadata standard across AI generation tools. Each tool uses its own storage location, format, field names, and encoding — ComfyUI stores JSON in PNG chunks, Midjourney uses Discord message strings, DALL-E returns API response objects — making cross-tool search architecturally difficult.

The name refers to the fact that even a simple concept like "which model generated this image" has completely different representations across tools. ComfyUI records a file path to a checkpoint in a JSON workflow graph. Midjourney records a version flag (--v 6.1) in a text string. DALL-E records a model name in an API response. Same semantic meaning, entirely different structural representation.

The two metadata problem is not just about formats — it extends to metadata richness. ComfyUI provides over 100 fields per generation. Midjourney provides 5-10. DALL-E provides 3-5. A normalization pipeline must accommodate this asymmetry without losing the rich information from detailed sources or failing on sparse sources.

Related Guides

Related Terms

See AI Asset Management in Action

Numonic automatically captures provenance, preserves metadata, and makes every AI-generated asset searchable and reproducible.