I used stable diffusion to create pictures of… things.
Just ChatGPT so far.
I did have Dall-E paint me a picture of “a mouse jumping a motorcycle through a flaming ring made of stone while pursued by vaguely ninja-like evil henchmen characters”
It ended up being this: https://i.imgur.io/ArDk1e1_d.webp?maxwidth=640&shape=thumb&fidelity=medium
Which makes me really, really want this as a video game. Just riding the motorcycle through various environments with ninjas popping out left and right trying to grab you. Sometimes they’ve got nunchucks, sometimes nets, sometimes they swing down on a rope to get you. You get power ups too like little bombs you can throw.
But that’s the only time I used the image generation. Mostly I’ve been having GPT-4 explain history and technology to me.
Just for non-serious things such as short stories that never leave the service and help coming up with names for characters and places in a story I’m writing about a pokemon region, I’ve been using Claude from time to time.
Otherwise I haven’t been doing much with besides one Japanese translation service (Miraitranslate) that claims to use AI for translations, but that’s very far and few between I use their demo thing.
I’ve been using Stable Diffusion (via Automatic1111) for a long time, I’ve become fairly adept at it. Recently Bing’s Dalle-3 has surpassed it in terms of composition and instruction-following, but I still find it really important for doing “finishing” work on Dalle-3’s outputs so I don’t expect to stop using it any time soon.
Lately I’ve been experimenting with Koboldcpp and locally-run large language models. I’ve been coming up with little ideas for scripts and programs that use its API to do stuff.
I’ve been using ChatGPT to find inspiration for greeting cards (for birthday, wedding etc.) for people I don’t know that well.
Stable Diffusion. Making AI generative art has totally edged out my video game addiction. Here’s my civitai profile
You guys aren’t using it for infinite hentai?
That too…
That’s pretty cool! I like the Max Headroom variants. Somehow, I think Mr Headroom in particular would approve of generative AI tech.
I’m just getting into this realm myself. I’m using ComfyUI, with SDXL 1.0 and the new LCM LoRA, but I’m really struggling to get, e.g. consistent framing. (Like, I’ll ask for “full length photo of X” and get nothing but close-up headshots for a dozen images)
Any advice or good resources you recommend?
Either way, very cool work, thanks for sharing!
Frankly, I’ve gotten nothing but shyte from LCM on the initial image. BUT it’s fantastic for upscaling img2img with a denoise of 0.1 and and Ultimate SD Upscale. Not sure how ComfyUI would do it though. I find its UX is too slow on my pc, so I stick to A1111.
But to solve your problem specifically, learn controlNet. By far my most used extension.
And for photorealistic images, the PDF at the link as been a godsend: https://promptgeek.gumroad.com/l/photoreal
YouTube channels: Olivio Sarikas: https://www.youtube.com/@OlivioSarikas Sebastian Kamph: https://www.youtube.com/@sebastiankamph
BTW, you have a civitai profile?
Awesome! Thank you!
I don’t have a civitai profile. I got into this to try to generate profile pics for a home ttrpg session, but I really enjoyed it and I’ve been having a ton of fun learning and trying to create better images.
I appreciate the resources! I know what my evenings are gonna be for awhile! 😁
Define free time.
I use bard mainly as a quick search engine. if it gives me back something useful quick enough fine, if not I do normal web searching.
I find bing’s copilot (or whatever it is called) far better.