10个令人惊叹的IC-LoRA用法

MODEL ZOO Nov 22, 2024

上下文LoRA(IC-LoRA) 对文本到图像模型进行微调,以生成具有可定制内在关系的图像集,可选择以另一组为条件,从而适应各种创作任务。

1、电影故事板生成

每个三幅图像序列都是使用上下文 LoRA 同时生成的。占位符角色名称在图像中唯一地引用角色的身份。

Prompt: “In this adventurous three-image sequence, [IMAGE1] Ethan, an intrepid archaeologist with a rugged appearance, uncovers an ancient map in a sunlit desert dig site, his excitement palpable as he brushes away the sand, [IMAGE2] transitioning to a bustling marketplace in a vibrant foreign city where Ethan negotiates with local merchants and gathers essential supplies for his quest, [IMAGE3] and finally, Ethan treks through a dense, mist-covered jungle, the towering trees and exotic wildlife emphasizing the challenges and mysteries that lie ahead on his journey.”
提示:“在这个充满冒险的三幅图像序列中,[图像 1] 伊桑,一位外表粗犷的无畏考古学家,在阳光明媚的沙漠挖掘现场发现了一张古老的地图,当他拂去沙子时,他的兴奋之情溢于言表;[图像 2] 镜头转到一个充满活力的外国城市的熙熙攘攘的市场,伊桑在那里与当地商人谈判,为他的探索收集必需品;[图像 3] 最后,伊桑穿越浓密、雾气弥漫的丛林,高耸的树木和奇异的野生动物强调了他旅途中面临的挑战和谜团。”

2、肖像摄影

每组四幅图像都是使用 In-Context LoRA 同时生成的,旨在保持每组图像中主题身份的一致性。

Prompt: “This set of four images showcases a teenage girl with curly black hair wearing a stylish denim jacket, each image highlighting her dynamic personality in urban settings; [IMAGE1] she is skateboarding down a graffiti-covered alley, a confident smile on her face as she maneuvers around obstacles; [IMAGE2] she is seated at a trendy café, typing on her laptop with focused determination, the bustling city life visible through the large windows behind her; [IMAGE3] she stands on a rooftop at sunset, her hair blowing in the breeze as she gazes thoughtfully over the city skyline; and [IMAGE4] she is laughing with friends at a vibrant street market, colorful lights and stalls creating a lively atmosphere around her.”
提示:“这组四幅图像展示了一个身穿时尚牛仔夹克、有着一头黑色卷发的少女,每幅图像都凸显了她在城市环境中的活力个性;[图片 1] 她正在一条涂鸦覆盖的小巷里玩滑板,脸上带着自信的微笑,绕过障碍物;[图片 2] 她坐在一家时髦的咖啡馆里,专注而坚定地在笔记本电脑上打字,透过她身后的大窗户可以看到熙熙攘攘的城市生活;[图片 3] 日落时分,她站在屋顶上,头发在微风中飘扬,若有所思地凝视着城市的天际线; [图片 4] 她正和朋友们在热闹的街市上欢声笑语,五颜六色的灯光和摊位营造出一种热闹的氛围。”

3、字体设计

每组四幅图像均与 In-Context LoRA 同时生成,旨在实现每组图像之间字体样式的一致性。

Prompt: “The set of four images features a minimalist handwriting font for casual use. [IMAGE1] shows "Everyday" on a coffee cup; [IMAGE2] displays "Notes" on a small journal; [IMAGE3] has "Live Simply" on a white pillow; [IMAGE4] shows "Good Vibes" on a cozy blanket, perfect for lifestyle and home decor branding.”

提示:“这组四幅图像采用简约的手写字体,适合日常使用。[IMAGE1] 咖啡杯上印有“Everyday”;[IMAGE2] 小日记本上印有“Notes”;[IMAGE3] 白色枕头上印有“Live Simply”;[IMAGE4] 舒适毯子上印有“Good Vibes”,非常适合生活方式和家居装饰品牌推广。”

4、家居装饰

每组四张图片均使用 In-Context LoRA 同时生成,旨在保持每组图片的装饰风格一致。

Prompt: “This set of four images captures a colorful, nature-inspired living space with touches of green and earthy textures; [IMAGE1] features a cozy nook with a woven chair draped in green blankets, surrounded by potted plants and botanical prints on the wall; [IMAGE2] highlights a rustic wooden shelf adorned with small planters, candles, and woven baskets; [IMAGE3] displays a serene bedroom with a bed made up in white linens, a natural wood nightstand, and a forest-themed mural; [IMAGE4] shows a close-up of a large plant pot with unique textures beside a patterned area rug.”
提示:“这组四张图片捕捉了一个色彩缤纷、受自然启发的生活空间,带有绿色和泥土质感; [图片 1] 展示了一个舒适的角落,里面有一把铺着绿色毯子的编织椅,四周环绕着盆栽植物,墙上挂着植物图案;[图片 2] 突出了一个质朴的木架,上面装饰着小花盆、蜡烛和编织篮;[图片 3] 展示了一间宁静的卧室,里面有一张铺着白色床单的床、一个天然木床头柜和一幅森林主题的壁画;[图片 4] 展示了一个大花盆的特写,花盆的纹理独特,旁边是一块有图案的地毯。”

5、PowerPoint 模板设计

每组四张图片均与 In-Context LoRA 同时生成,旨在为每组幻灯片创建连贯统一的演示风格。

Prompt: “This set of four images showcases a rustic-themed PowerPoint template for a culinary workshop; [IMAGE1] introduces "Farm to Table Cooking" in warm, earthy tones; [IMAGE2] organizes workshop sections like "Ingredients," "Preparation," and "Serving"; [IMAGE3] displays ingredient lists for seasonal produce; [IMAGE4] includes chef profiles with short bios.”
提示:“这组四张图片展示了一个乡村主题的烹饪工作坊 PowerPoint 模板;[IMAGE1] 以温暖朴实的色调介绍了“从农场到餐桌的烹饪”;[IMAGE2] 组织了“配料”、“准备”和“上菜”等工作坊部分;[IMAGE3] 展示了时令农产品的配料清单;[IMAGE4] 包括厨师简介和简短的简历。”

6、情侣资料生成

每对图像都与 In-Context LoRA 同时生成,旨在在每组中的两张图像中保持一致的风格和身份特征。

Prompt: “This pair of images features a couple as cartoon characters in medieval attire; [IMAGE1] shows a knight with a plumed helmet and a determined look, holding a small shield, while [IMAGE2] displays a character dressed as a princess with a crown, smiling as they hold a flower, both against a castle background.”
提示:“这对图像以一对身着中世纪服装的卡通人物为特色;[IMAGE1] 展示了一位戴着羽毛头盔、目光坚定的骑士,手里拿着一面小盾牌,而 [IMAGE2] 展示了一位身着公主头戴王冠的人物,他们面带微笑,手里拿着一朵花,背景都是城堡。”

7、视觉识别设计

每对图像均与 In-Context LoRA 同时生成,旨在实现每对图像中两幅图像的连贯一致的视觉识别。

Prompt: “The pair of images highlights a logo and its real-world use for a rustic coffee brand; [IMAGE1] a striking teal background showcases a logo with a stylized, perched bird in black and white, titled “Bluebird Roast” in an elegant serif font, with a leafy branch detail underneath; [IMAGE2] this logo is applied to a coffee mug sitting atop a woven coaster on a dark mahogany table, with a blurred background that emphasizes the warm tones and classic aesthetic of the branding in a cozy setting.”
提示:“这对图像突出了一个乡村咖啡品牌的标志及其在现实世界中的用途;[图像 1] 醒目的蓝绿色背景展示了一个黑白风格的栖息鸟标志,标题为“Bluebird Roast”,采用优雅的衬线字体,下方有绿叶树枝细节;[图像 2] 这个标志被应用到一个咖啡杯上,放在深色桃花心木桌上的编织杯垫上,背景模糊,在舒适的环境中强调了品牌的暖色调和经典美感。”

8、肖像插画

每对图像均使用 In-Context LoRA 生成,旨在在“之前”和“之后”的插画版本之间保持一致的身份、服装、表情、相似的姿势和氛围。插画不是直接复制原始照片,而是通过增加表现力来增强关键特征。

Prompt: “This image pair presents a transformation from a realistic portrait to a playful illustration, capturing both detail and artistic flair; [IMAGE1] the photograph shows a woman standing in a bustling marketplace, wearing a wide-brimmed hat, a flowing bohemian dress, and a leather crossbody bag; [IMAGE2] the illustration version exaggerates her accessories and features, with the bohemian dress depicted in vibrant patterns and bold colors, while the background is simplified into abstract market stalls, giving the scene an animated and lively feel.”
提示:“这对图像展示了从现实肖像到俏皮插画的转变,既捕捉了细节又捕捉了艺术气息;[图片 1] 照片显示一名女子站在熙熙攘攘的市场中,戴着宽边帽,穿着飘逸的波西米亚连衣裙,拎着皮革斜挎包;[图片 2] 插画版本夸大了她的配饰和特征,波西米亚连衣裙以鲜艳的图案和大胆的色彩描绘,而背景则简化为抽象的市场摊位,给场景带来生动活泼的感觉。”

9、沙尘暴视觉效果

每个图像对都是使用 In-Context LoRA 生成的,旨在展示“之前”和“之后”沙尘暴效果图像之间的高度一致性。

Prompt: “This image pair showcases the transformation of a cyclist through a sandstorm visual effect; [IMAGE1] features a cyclist in vibrant gear pedaling steadily on a clear, open road with a serene sky in the background, highlighting focus and determination, [IMAGE2] transforms the scene as the cyclist becomes enveloped in a fierce sandstorm, with sand particles swirling intensely around the bike and rider against a stormy, darkened backdrop, emphasizing chaos and power.”
提示:“这对图像通过沙尘暴视觉效果展示了骑自行车的人的转变; [图片 1] 展示了一位身着鲜艳装备的骑行者在晴朗开阔的道路上稳步骑行,背景是宁静的天空,突显出他的专注和决心。[图片 2] 场景发生了变化,骑行者被猛烈的沙尘暴笼罩,沙粒在自行车和骑行者周围剧烈旋转。而不是暴风雨般的黑暗背景,强调混乱和力量。”

10、图像条件生成

使用无需训练的 SDEdit 在多个任务中使用 In-Context LoRA 进行图像条件生成的示例。

肖像身份转移:

字体样式转移:

视觉身份的应用:

肖像到插图:


原文链接:In-Context LoRA for Diffusion Transformers

汇智网翻译整理,转载请标明出处

Tags