[CVPR 2025] EditSplat: Multi-View Fusion and Attention-Guided Optimization for View-Consistent 3D Scene Editing with 3D Gaussian Splatting

Dong In Lee¹, Hyeongcheol Park¹, Jiyoung Seo¹, Eunbyung Park²,
Hyunje Park¹, Ha Dam Baek¹, Sangheon Shin³, Sangmin Kim³, Sangpil Kim^1†

¹Korea University, ²Yonsei University, ³Hanwha Systems

⚙️ Installation

Tested on Ubuntu 22.04 + CUDA 11.8 + Python 3.9 (RTX A6000 / RTX 3090).

Note: The GPU memory requirement depends on your dataset size.

conda env create -f environment.yaml
conda activate editsplat

📂 Dataset and Pretrained Weights

We provide datasets and pretrained weights for all scenes presented in our paper, allowing users to easily reproduce our results and experiment further.

📥 Download: Google Drive

After downloading, move the dataset into the cvpr25_EditSplat/dataset/ directory.

If you want to edit your own dataset, you must first pre-train a 3D Gaussian Splatting (3DGS) model from your custom dataset using COLMAP for camera poses.

🎨 Editing

To run the editing pipeline:

./script/editing_face_to_marble_sculpture.sh

The edited 3D Gaussian Splatting outputs will be saved under cvpr25_EditSplat/output.

You can render custom novel views from the updated 3D scene stored in cvpr25_EditSplat/output/point_cloud/.

💻 Command Line Arguments for editing

python run_editing.py -s ./dataset/dataset/face -m output/face_to_marble_sculpture --source_checkpoint ./dataset/pretrained/face/chkpnt30000.pth --object_prompt "face" --target_prompt "Make his face resemble that of a marble sculpture" --sampling_prompt "a photo of a marble sculpture" --target_mask_prompt "face"

--source_path / -s

Path to the source directory containing a COLMAP.

--source_checkpoint

Path to the pretrained 3D Gaussian Splatting (3DGS) checkpoint (.pth) you wish to edit. Example: ./dataset/<scene_name>/chkpnt30000.pth

--model_path / -m

Path where the edited model should be stored (output/ by default).

--target_prompt

A text instruction describing the desired edit, written in the format compatible with InstructPix2Pix.

--object_prompt

The object keyword contained in the target_prompt. This is used in Attention-Guided Trimming (AGT) to extract cross-attention maps from the diffusion model and assign them to the pretrained 3DGS for local editing and pruning.

--sampling_prompt

A sentence that describes the expected result after editing. The ImageReward model uses this prompt to rank the initially edited images and filter out the bottom 15% with the lowest scores before projection.

--target_mask_prompt

An object class name (e.g., “marble sculpture”, “wildboar”) representing the expected object after editing. Used in Multi-View Fusion Guidance (MFG) to generate a segmentation mask via the SAM model. It doesn’t need to appear in the target_prompt. The mask guides background replacement with content from the source dataset.

--iteration

Number of total iterations to edit for, 30_000 by default.

--eval

Add this flag to use a MipNeRF360-style training/test split for evaluation.

Note that, similar to other baselines, we use images with a resolution of 512×512, as required by the InstructPix2Pix model.

To produce your own edited results, you can maximize performance by tuning the following hyperparameters. Defaults are set according to those used in the main paper.

🛠️ Hyperparameter Details

--epoch

Number of epochs to optimize the edited 3D Gaussian Splatting. Default: 10

AGT (Attention-Guided Trimming)

--attn_thres

Cross-attention threshold w_thres. A higher value leads to tighter localization but may overly restrict the editable region. Default: 0.1

--k_percent

Pruning proportion k for the first densification step. A high value may remove too many Gaussians, degrading editing quality. Default: 0.15

MFG (Multi-View Fusion Guidance)

--text_guidance_scale

Weight sT for the text guidance in the diffusion model. Higher values enforce stronger adherence to the instruction prompt. Default: 7.5

--MFG_scale

Weight sM for the multi-view fusion guidance. Controls the contribution of multi-view information (hM) in the editing. Default: 1.0

--source_guidance_scale

Weight sS for the original source image guidance. Helps preserve original source information in editing. Default: 0.5

--filtering_ratio

Filters out the bottom (1 - filtering_ratio)% of initial edited views based on ImageReward scores, before projection in MFG. Default: 0.15

Advanced Tuning

For additional 3DGS-specific hyperparameters such as feature_lr, opacity_lr, scaling_lr, rotation_lr, etc.

please refer to the official 3D Gaussian Splatting repository.

🎬 Rendering

We provide several convenient rendering options for visualizing your edited 3D Gaussian Splatting (3DGS) models.

▶️ Render Novel View Video

Generate novel view videos and GIF animations:

python render.py --model_path output/face_to_marble_sculpture --iteration 30560 --video

The resulting videos and GIFs are saved under:

output/face_to_marble_sculpture/video/ours_30560/
├── final_video.mp4
└── final_video.gif

📜 Citation

If you find our work useful, please consider citing:

@InProceedings{Lee_2025_CVPR,
    author    = {Lee, Dong In and Park, Hyeongcheol and Seo, Jiyoung and Park, Eunbyung and Park, Hyunje and Baek, Ha Dam and Shin, Sangheon and Kim, Sangmin and Kim, Sangpil},
    title     = {EditSplat: Multi-View Fusion and Attention-Guided Optimization for View-Consistent 3D Scene Editing with 3D Gaussian Splatting},
    booktitle = {Proceedings of the Computer Vision and Pattern Recognition Conference (CVPR)},
    month     = {June},
    year      = {2025},
    pages     = {11135-11145}
}

Acknowledgement

Our code is based on these wonderful repos:

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
arguments		arguments
assets		assets
gaussian_renderer		gaussian_renderer
scene		scene
scripts		scripts
submodules		submodules
utils		utils
.gitignore		.gitignore
README.md		README.md
environment.yaml		environment.yaml
render.py		render.py
run_editing.py		run_editing.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

[CVPR 2025] EditSplat: Multi-View Fusion and Attention-Guided Optimization for View-Consistent 3D Scene Editing with 3D Gaussian Splatting

⚙️ Installation

📂 Dataset and Pretrained Weights

🎨 Editing

--source_path / -s

--source_checkpoint

--model_path / -m

--target_prompt

--object_prompt

--sampling_prompt

--target_mask_prompt

--iteration

--eval

--epoch

AGT (Attention-Guided Trimming)

--attn_thres

--k_percent

MFG (Multi-View Fusion Guidance)

--text_guidance_scale

--MFG_scale

--source_guidance_scale

--filtering_ratio

Advanced Tuning

🎬 Rendering

▶️ Render Novel View Video

📜 Citation

Acknowledgement

About

Uh oh!

Releases

Packages

Contributors 3

Uh oh!

Languages

kuai-lab/cvpr25_EditSplat

Folders and files

Latest commit

History

Repository files navigation

[CVPR 2025] EditSplat: Multi-View Fusion and Attention-Guided Optimization for View-Consistent 3D Scene Editing with 3D Gaussian Splatting

⚙️ Installation

📂 Dataset and Pretrained Weights

🎨 Editing

--source_path / -s

--source_checkpoint

--model_path / -m

--target_prompt

--object_prompt

--sampling_prompt

--target_mask_prompt

--iteration

--eval

--epoch

AGT (Attention-Guided Trimming)

--attn_thres

--k_percent

MFG (Multi-View Fusion Guidance)

--text_guidance_scale

--MFG_scale

--source_guidance_scale

--filtering_ratio

Advanced Tuning

🎬 Rendering

▶️ Render Novel View Video

📜 Citation

Acknowledgement

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Uh oh!

Languages

Packages