论文/代码速递2022.10.17!
整理:AI算法与图像处理
CVPR2022论文和代码整理:https://github.com/DWCTOD/CVPR2022-Papers-with-Code-Demo
ECCV2022论文和代码整理:https://github.com/DWCTOD/ECCV2022-Papers-with-Code-Demo
最新成果demo展示:
标题:SCAM! Transferring humans between images with Semantic Cross Attention Modulation
主页:https://imagine.enpc.fr/~dufourn/publications/scam.html
代码:https://github.com/nicolas-dufour/SCAM
论文:https://arxiv.org/pdf/2210.04883v1.pdf
最近的大量工作以语义条件下的图像生成为目标。大多数这类方法只关注较窄的姿势转移任务,而忽略了更具挑战性的对象转移任务,即不仅转移姿势,还转移外观和背景。在这项工作中,我们引入了SCAM(Semantic Cross Attention Modulation,语义交叉注意调制),这是一个系统,它对图像的每个语义区域(包括前景和背景)中丰富多样的信息进行编码,从而实现了以细节为重点的精确生成。这是由Semantic Attention Transformer Encoder实现的,该编码器为每个语义区域提取多个潜在向量,以及通过使用语义交叉注意调制来利用这些潜在向量的相应生成器。它仅使用重建设置进行训练,而受试者在测试时进行转移。我们的分析表明,我们提出的架构在编码每个语义区域的外观多样性方面是成功的。iDesigner和CelebAMask HD数据集上的大量实验表明,SCAM优于SEAN和SPADE;此外,它还开创了学科转移的新境界。
最新论文整理
ECCV2022
Updated on : 17 Oct 2022
total number : 4
Intel Labs at Ego4D Challenge 2022: A Better Baseline for Audio-Visual Diarization
- 论文/Paper: http://arxiv.org/pdf/2210.07764
- 代码/Code: None
The Surprisingly Straightforward Scene Text Removal Method With Gated Attention and Region of Interest Generation: A Comprehensive Prominent Model Analysis
- 论文/Paper: http://arxiv.org/pdf/2210.07489
- 代码/Code: https://github.com/naver/garnet
Autoregressive Uncertainty Modeling for 3D Bounding Box Prediction
- 论文/Paper: http://arxiv.org/pdf/2210.07424
- 代码/Code: None
Task Grouping for Multilingual Text Recognition
- 论文/Paper: http://arxiv.org/pdf/2210.07423
- 代码/Code: None
CVPR2022
NeurIPS
Updated on : 17 Oct 2022
total number : 13
Learnable Polyphase Sampling for Shift Invariant and Equivariant Convolutional Networks
- 论文/Paper: http://arxiv.org/pdf/2210.08001
- 代码/Code: None
AVLEN: Audio-Visual-Language Embodied Navigation in 3D Environments
- 论文/Paper: http://arxiv.org/pdf/2210.07940
- 代码/Code: None
MOVE: Unsupervised Movable Object Segmentation and Detection
- 论文/Paper: http://arxiv.org/pdf/2210.07920
- 代码/Code: None
One Model to Edit Them All: Free-Form Text-Driven Image Manipulation with Semantic Modulations
- 论文/Paper: http://arxiv.org/pdf/2210.07883
- 代码/Code: page:https://github.com/KumapowerLIU/FFCLIP.
Model-Based Imitation Learning for Urban Driving
- 论文/Paper: http://arxiv.org/pdf/2210.07729
- 代码/Code: https://github.com/wayveai/mile.
Quo Vadis: Is Trajectory Forecasting the Key Towards Long-Term Multi-Object Tracking?
- 论文/Paper: http://arxiv.org/pdf/2210.07681
- 代码/Code: https://github.com/dendorferpatrick/QuoVadis.
DART: Articulated Hand Model with Diverse Accessories and Rich Textures
- 论文/Paper: http://arxiv.org/pdf/2210.07650
- 代码/Code: None
Mix and Reason: Reasoning over Semantic Topology with Data Mixing for Domain Generalization
- 论文/Paper: http://arxiv.org/pdf/2210.07571
- 代码/Code: None
TokenMixup: Efficient Attention-guided Token-level Data Augmentation for Transformers
- 论文/Paper: http://arxiv.org/pdf/2210.07562
- 代码/Code: https://github.com/mlvlab/tokenmixup
Weakly-Supervised Multi-Granularity Map Learning for Vision-and-Language Navigation
- 论文/Paper: http://arxiv.org/pdf/2210.07506
- 代码/Code: https://github.com/peihaochen/ws-mgmap
Learning Active Camera for Multi-Object Navigation
- 论文/Paper: http://arxiv.org/pdf/2210.07505
- 代码/Code: None
Evaluating Out-of-Distribution Performance on Document Image Classifiers
- 论文/Paper: http://arxiv.org/pdf/2210.07448
- 代码/Code: None
A Consistent and Differentiable Lp Canonical Calibration Error Estimator
- 论文/Paper: http://arxiv.org/pdf/2210.07810
- 代码/Code: https://github.com/tpopordanoska/ece-kde
相关文章
- 经典神经网络 | ResNet 论文解析及代码实现
- 经典神经网络 | GoogleNet 论文解析及代码实现
- lasso回归matlab,机器学习Lasso回归重要论文和Matlab代码「建议收藏」
- [论文] 一类Poisson-Nernst-Planck方程的边平均有限元计算
- 论文DepthTrack: Unveiling the Power of RGBD Tracking阅读及代码讲解[通俗易懂]
- 模型代码论文一键达!机器之心SOTA!模型联合清华AMiner团队升级「速读论文」新功能
- 总结的几篇较好论文实现代码(附源代码下载)
- ECCV2022 | 超越 SPADE,SCAM语义生成图像能应对更具挑战性的任务! 论文/代码速递2022.10.14!
- 论文/代码速递2022.10.20!
- 论文/代码速递2022.10.24!
- ECCV 2022 | 开放集半监督目标检测!论文/代码速递2022.10.27!
- 论文/代码速递2022.10.28!
- 论文/代码速递2022.11.1!
- DELTAR:轻量级 ToF 传感器和 RGB 图像的深度估计!论文/代码速递2022.11.3!
- FactorMatte:最新视频抠图算法,更适合于视频合成任务!论文/代码速递2022.11.9!
- 清华&腾讯最新算法Next3D!高质量3D 感知合成,支持3D风格画!论文/代码速递2022.11.24!
- 论文/代码速递2022.11.28!
- R语言JAGS贝叶斯回归模型分析博士生延期毕业完成论文时间|附代码数据
- [KDD 2022 | 论文简读] 用于实体对齐的多模态孪生神经网络
- [Nature Chemistry | 论文简读] 动态解除对接和准束缚态作为药物发现的工具
- [ICCV | 论文简读] 深度自适应图像聚类
- 图灵机就是深度学习最热循环神经网络RNN?1996年论文就已证明!
- 【Python】这篇罕见的符号编程论文,让你在Jupyter Notebook中手绘草图并变成代码
- 四年积累,论文、网课、博客全打包,500+话题任意学,这儿有一本机器学习免费书
- Photoshop把AI论文demo打包实现了:照片上色、改年龄、换表情只需要点点鼠标
- 2023美赛Y题二手帆船价格--成品论文、思路、数据、代码
- 哪5篇ICRA论文入选了最佳会议论文大奖? | ICRA 2017
- ICRA十大领域论文角逐,最佳得主已出炉!|ICRA 2017
- 张文宏千字长文谈疫情:未回应博士学位论文被举报