A 3d Rendering Engine for ComfyUI #1378
paulh4x
started this conversation in
Built with Kilo
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
🧊 ComfyUI-PHRenderFormerWrapper 🧊
PH's ComfyUI RenderFormer Wrapper is a complete set of custom nodes for tokenbased 3d rendering inside ComfyUI.
ComfyUI_00184_.mp4
Download and Documentation: ComfyUI_PHRenderFormerWrapper GitHub Repository
RenderFormer is a model for neural rendering of 3d geometry based on tokens, it is a new approach in rendering 3d geometry and is best described in this 5 min youtube video: https://www.youtube.com/watch?v=qYJk9l65eJ8&ab_channel=TwoMinutePapers. The model is being presented by the authors at the next siggraph and i see a lot of potential in it. Thats why i tried to contribute something back to the opensource community, without having written any code before.
Everything is very limited and its just my babysteps in coding, but here are the actual features:
🎨 End-to-End Rendering: Load 3D models, define materials, set up cameras, and render—all within ComfyUI.
↔️ Full Transformation Control: Apply translation, rotation, and scaling to any object or light in the scene.
⚙️ Modular Node-Based Workflow: Each step of the rendering pipeline is a separate node, allowing for flexible and complex setups.
🎥 Animation & Video: Create camera and light animations by interpolating between keyframes. The nodes output image batches compatible with ComfyUI's native video-saving nodes.
🔧 Advanced Mesh Processing: Includes nodes for loading, combining, remeshing, and applying simple color randomization to your 3D assets.
💡 Lighting and Material Control: Easily add and combine multiple light sources and control PBR material properties like diffuse, specular, roughness, and emission.
More resources:
ComfyUI
microsoft's RenderFormer GitHub Repository
I am completely new to all of this and was pointed to kilo code, then i installed vs code for it and tried with the free 20$ credtis. The first attempt failed but then i read about the architectmode and created a plan for this:
I added a wrapper for hunyuan3d model by kijai and the example nodepack from filltm as context
I cannot remember the exact prompt for this but it was something like:
From then on it went pretty good, when the plan was adjusted to certain tasks, like making this an independent repo or adding animation functionality to an object. Basically i kept adding functions and nodes in code-mode, used debug mode when i got an error. This was exceptional easy because of debugging and pasting the logs back fixed almost everything. Sometimes i had to point to certain files like the rendering_pipeline.py or batch_infer.py that i thought could be of use to a llm to understand the process and that also helped a lot. Sometimes my context window exceeded by far because i forgot to look out for it and was complete mindblown how far i got with this. All the major steps have been made with the free credits only, but especially closer to thursdays i ran short, in the end i really wanted it to work out and spent 6x15€ during the entire creation, that took mainly place on some of the nights between 06/25 to 07/15, i would estimate the entire amount of time spent with this from installing kilo to publicly open the repo to about 30-40 hours with a lot of adjusting my prompts before sending them.
I would love to push this development further and appreciate any ideas, feedback and support for it.
Thanks for the read,
/PH
Beta Was this translation helpful? Give feedback.
All reactions