Search Results - "semantics"

Refine Results
  1. 1

    Computer vision - ECCV 2022 : 17th European Conference, Tel Aviv, Israel, October 23-27, 2022 : proceedings. Part XXIX by European Conference on Computer Vision Tel Aviv, Israel, SpringerLink (Online service)

    Published: Springer, 2022
    Description: 1 online resource (lvi, 757 pages) : illustrations (chiefly color).
    Contents: “…Box-Supervised Instance Segmentation with Level Set Evolution -- Point Primitive Transformer for Long-Term 4D Point Cloud Video Understanding -- Adaptive Agent Transformer for Few-Shot Segmentation -- Waymo Open Dataset: Panoramic Video Panoptic Segmentation -- TransFGU: A Top-down Approach to Fine-Grained Unsupervised Semantic Segmentation -- AdaAfford: Learning to Adapt Manipulation Affordance for 3D Articulated Objects via Few-Shot Interactions -- Cost Aggregation with 4D Convolutional Swin Transformer for Few-Shot Segmentation -- Fine-Grained Egocentric Hand-Object Segmentation: Dataset, Model, and Applications -- Perceptual Artifacts Localization for Inpainting -- 2D Amodal Instance Segmentation Guided by 3D Shape Prior -- Data Efficient 3D Learner via Knowledge Transferred from 2D Model -- Adaptive Spatial-BCE Loss for Weakly Supervised Semantic Segmentation -- Dense Gaussian Processes for Few-Shot Segmentation -- 3D Instances as 1D Kernels -- TransMatting: Enhancing Transparent Objects Matting with Transformers -- MVSalNet:Multi-View Augmentation for RGB-D Salient Object Detection -- k-Means Mask Transformer -- SegPGD: An Effective and Efficient Adversarial Attack for Evaluating and Boosting Segmentation Robustness -- Adversarial Erasing Framework via Triplet with Gated Pyramid Pooling Layer for Weakly Supervised Semantic Segmentation -- Continual Semantic Segmentation via Structure Preserving and Projected Feature Alignment -- Interclass Prototype Relation for Few-Shot Segmentation -- Slim Scissors: Segmenting Thin Object from Synthetic Background -- Abstracting Sketches through Simple Primitives -- Multi-Scale and Cross-Scale Contrastive Learning for Semantic Segmentation -- One-Trimap Video Matting -- D2ADA: Dynamic Density-Aware Active Domain Adaptation for Semantic Segmentation -- Learning Quality-Aware Dynamic Memory for Video Object Segmentation -- Learning Implicit Feature Alignment Function for Semantic Segmentation -- Quantum Motion Segmentation -- Instance As Identity: A Generic Online Paradigm for Video Instance Segmentation -- Laplacian Mesh Transformer: Dual Attention and Topology Aware Network for 3D Mesh Classification and Segmentation -- Geodesic-Former: A Geodesic-Guided Few-Shot 3D Point Cloud Instance Segmenter -- Union-Set Multi-source Model Adaptation for Semantic Segmentation -- Point MixSwap: Attentional Point Cloud Mixing via Swapping Matched Structural Divisions -- BATMAN: Bilateral Attention Transformer in Motion-Appearance Neighboring Space for Video Object Segmentation -- SPSN: Superpixel Prototype Sampling Network for RGB-D Salient Object Detection -- Global Spectral Filter Memory Network for Video Object Segmentation -- Video Instance Segmentation via Multi-Scale Spatio-Temporal Split Attention Transformer -- RankSeg: Adaptive Pixel Classification with Image Category Ranking for Segmentation -- Learning Topological Interactions for Multi-Class Medical Image Segmentation -- Unsupervised Segmentation in Real-World Images via Spelke Object Inference -- A Simple Baseline for Open-Vocabulary Semantic Segmentation with Pre-trained Vision-Language Model.…”
    SpringerLink - Click here for access
    Conference Proceeding eBook
  2. 2

    Computer vision - ECCV 2022 : 17th European Conference, Tel Aviv, Israel, October 23-27, 2022 : proceedings. Part XXVIII by European Conference on Computer Vision Tel Aviv, Israel, SpringerLink (Online service)

    Published: Springer, 2022
    Description: 1 online resource (lvi, 751 pages) : illustrations (chiefly color).
    Contents: “…Salient Object Detection for Point Clouds -- Learning Semantic Segmentation from Multiple Datasets with Label Shifts -- Weakly Supervised 3D Scene Segmentation with Region-Level Boundary Awareness and Instance Discrimination -- Towards Open-Vocabulary Scene Graph Generation with Prompt-Based Finetuning -- Variance-Aware Weight Initialization for Point Convolutional Neural Networks -- Break and Make: Interactive Structural Understanding Using LEGO Bricks -- Bi-PointFlowNet: Bidirectional Learning for Point Cloud Based Scene Flow Estimation -- 3DG-STFM: 3D Geometric Guided Student-Teacher Feature Matching -- Video Restoration Framework and Its Meta-Adaptations to Data-Poor Conditions -- MonteBoxFinder: Detecting and Filtering Primitives to Fit a Noisy Point Cloud -- Scene Text Recognition with Permuted Autoregressive Sequence Models -- When Counting Meets HMER: Counting-Aware Network for Handwritten Mathematical Expression Recognition -- Detecting Tampered Scene Text in the Wild -- Optimal Boxes: Boosting End-to-End Scene Text Recognition by Adjusting Annotated Bounding Boxes via Reinforcement Learning -- GLASS: Global to Local Attention for Scene-Text Spotting -- COO: Comic Onomatopoeia Dataset for Recognizing Arbitrary or Truncated Texts -- Language Matters: A Weakly Supervised Vision-Language Pre-training Approach for Scene Text Detection and Spotting -- Toward Understanding WordArt: Corner-Guided Transformer for Scene Text Recognition -- Levenshtein OCR -- Multi-Granularity Prediction for Scene Text Recognition -- Dynamic Low-Resolution Distillation for Cost-Efficient End-to-End Text Spotting -- Contextual Text Block Detection towards Scene Text Understanding -- CoMER: Modeling Coverage for Transformer-Based Handwritten Mathematical Expression Recognition -- Dont Forget Me: Accurate Background Recovery for Text Removal via Modeling Local-Global Context -- TextAdaIN: Paying Attention to Shortcut Learning in Text Recognizers -- Multi-modal Text Recognition Networks: Interactive Enhancements between Visual and Semantic Features -- SGBANet: Semantic GAN and Balanced Attention Network for Arbitrarily Oriented Scene Text Recognition -- Pure Transformer with Integrated Experts for Scene Text Recognition -- OCR-Free Document Understanding Transformer -- CAR: Class-Aware Regularizations for Semantic Segmentation -- Style-Hallucinated Dual Consistency Learning for Domain Generalized Semantic Segmentation -- SeqFormer: Sequential Transformer for Video Instance Segmentation -- Saliency Hierarchy Modeling via Generative Kernels for Salient Object Detection -- In Defense of Online Models for Video Instance Segmentation -- Active Pointly-Supervised Instance Segmentation -- A Transformer-Based Decoder for Semantic Segmentation with Multi-level Context Mining -- XMem: Long-Term Video Object Segmentation with an Atkinson- Shiffrin Memory Model -- Self-Distillation for Robust LiDAR Semantic Segmentation in Autonomous Driving -- 2DPASS: 2D Priors Assisted Semantic Segmentation on LiDAR Point Clouds -- Extract Free Dense Labels from CLIP -- 3D Compositional Zero-Shot Learning with DeCompositional Consensus -- Video Mask Transfiner for High-Quality Video Instance Segmentation.…”
    SpringerLink - Click here for access
    Conference Proceeding eBook
  3. 3

    Computer vision -- ECCV 2022 : 17th European Conference, Tel Aviv, Israel, October 23-27, 2022 : proceedings. Part XXX by European Conference on Computer Vision Tel Aviv, Israel, SpringerLink (Online service)

    Published: Springer, 2022
    Description: 1 online resource : illustrations (black and white).
    Contents: “…Fast Two-View Motion Segmentation Using Christoffel Polynomials -- UCTNet: Uncertainty-Aware Cross-Modal Transformer Network for Indoor RGB-D Semantic Segmentation -- Bi-directional Contrastive Learning for Domain Adaptive Semantic Segmentation -- Learning Regional Purity for Instance Segmentation on 3D Point Clouds -- Cross-Domain Few-Shot Semantic Segmentation -- Generative Subgraph Contrast for Self-Supervised Graph Representation Learning -- SdAE: Self-Distillated Masked Autoencoder -- Demystifying Unsupervised Semantic Correspondence Estimation -- Open-Set Semi-Supervised Object Detection -- Vibration-Based Uncertainty Estimation for Learning from Limited Supervision -- Concurrent Subsidiary Supervision for Unsupervised Source-Free Domain Adaptation -- Weakly Supervised Object Localization through Inter-class Feature Similarity and Intra-Class Appearance Consistency -- Active Learning Strategies for Weakly-Supervised Object Detection -- Mc-BEiT: Multi-Choice Discretization for Image BERT Pre-training -- Bootstrapped Masked Autoencoders for Vision BERT Pretraining -- Unsupervised Visual Representation Learning by Synchronous Momentum Grouping -- Improving Few-Shot Part Segmentation Using Coarse Supervision -- What to Hide from Your Students: Attention-Guided Masked Image Modeling -- Pointly-Supervised Panoptic Segmentation -- MVP: Multimodality-Guided Visual Pre-training -- Locally Varying Distance Transform for Unsupervised Visual Anomaly Detection -- HRDA: Context-Aware High-Resolution Domain-Adaptive Semantic Segmentation -- SPot-the-Difference Self-Supervised Pre-training for Anomaly Detection and Segmentation -- Dual-Domain Self-Supervised Learning and Model Adaption for Deep Compressive Imaging -- Unsupervised Selective Labeling for More Effective Semi-Supervised Learning -- Max Pooling with Vision Transformers Reconciles Class and Shape in Weakly Supervised Semantic Segmentation -- Dense Siamese Network for Dense Unsupervised Learning -- Multi-Granularity Distillation Scheme towards Lightweight Semi-Supervised Semantic Segmentation -- CP2: Copy-Paste Contrastive Pretraining for Semantic Segmentation -- Self-Filtering: A Noise-Aware Sample Selection for Label Noise with Confidence Penalization -- RDA: Reciprocal Distribution Alignment for Robust Semi-Supervised Learning -- MemSAC: Memory Augmented Sample Consistency for Large Scale Domain Adaptation -- United Defocus Blur Detection and Deblurring via Adversarial Promoting Learning -- Synergistic Self-Supervised and Quantization Learning -- Semi-Supervised Vision Transformers -- Domain Adaptive Video Segmentation via Temporal Pseudo Supervision -- Diverse Learner: Exploring Diverse Supervision for Semi-Supervised Object Detection -- A Closer Look at Invariances in Self-Supervised Pre-training for 3D Vision -- ConMatch: Semi-Supervised Learning with Confidence-Guided Consistency Regularization -- FedX: Unsupervised Federated Learning with Cross Knowledge Distillation -- W2N: Switching from Weak Supervision to Noisy Supervision for Object Detection -- Decoupled Adversarial Contrastive Learning for Self-Supervised Adversarial Robustness.…”
    SpringerLink - Click here for access
    Conference Proceeding eBook
  4. 4

    Computer vision - ECCV 2022 : 17th European Conference, Tel Aviv, Israel, October 23-27, 2022 : proceedings. Part XXXIV by European Conference on Computer Vision Tel Aviv, Israel, SpringerLink (Online service)

    Published: Springer, 2022
    Description: 1 online resource (lvi, 763 pages) : illustrations (chiefly color).
    Contents: “…Interpretable Open-Set Domain Adaptation via Angular Margin Separation -- TACS: Taxonomy Adaptive Cross-Domain Semantic Segmentation -- Prototypical Contrast Adaptation for Domain Adaptive Semantic Segmentation -- RBC: Rectifying the Biased Context in Continual Semantic Segmentation -- Factorizing Knowledge in Neural Networks -- Contrastive Vicinal Space for Unsupervised Domain Adaptation -- Cross-Modal Knowledge Transfer without Task-Relevant Source Data -- Online Domain Adaptation for Semantic Segmentation in Ever-Changing Conditions -- Source-Free Video Domain Adaptation by Learning Temporal Consistency for Action Recognition -- BMD: A General Class-Balanced Multicentric Dynamic Prototype Strategy for Source-Free Domain Adaptation -- Generalized Brain Image Synthesis with Transferable Convolutional Sparse Coding Networks -- Incomplete Multi-View Domain Adaptation via Channel Enhancement and Knowledge Transfer -- DistPro: Searching a Fast Knowledge Distillation Process via Meta Optimization -- ML-BPM: Multi-Teacher Learning with Bidirectional Photometric Mixing for Open Compound Domain Adaptation in Semantic Segmentation -- PACTran: PAC-Bayesian Metrics for Estimating the Transferability of Pretrained Models to Classification Tasks -- Personalized Education: Blind Knowledge Distillation -- Not All Models Are Equal: Predicting Model Transferability in a Self-Challenging Fisher Space -- How Stable Are Transferability Metrics Evaluations? …”
    SpringerLink - Click here for access
    Conference Proceeding eBook
  5. 5

    Computer vision - ECCV 2022 : 17th European Conference, Tel Aviv, Israel, October 23-27, 2022 : proceedings. Part XXXVII by European Conference on Computer Vision Tel Aviv, Israel, SpringerLink (Online service)

    Published: Springer, 2022
    Description: 1 online resource (lvi, 753 pages) : illustrations (chiefly color).
    Contents: “…Most and Least Retrievable Images in Visual-Language Query Systems -- Sports Video Analysis on Large-Scale Data -- Grounding Visual Representations with Texts for Domain Generalization -- Bridging the Visual Semantic Gap in VLN via Semantically Richer Instructions -- StoryDALL-E: Adapting Pretrained Text-to-Image Transformers for Story Continuation -- VQGAN-CLIP: Open Domain Image Generation and Editing with Natural Language Guidance -- Semantic-Aware Implicit Neural Audio-Driven Video Portrait Generation -- End-to-End Active Speaker Detection -- Emotion Recognition for Multiple Context Awareness -- Adaptive Fine-Grained Sketch-Based Image Retrieval -- Quantized GAN for Complex Music Generation from Dance Videos -- Uncertainty-Aware Multi-modal Learning via Cross-Modal Random Network Prediction -- Localizing Visual Sounds the Easy Way -- Learning Visual Styles from Audio-Visual Associations -- Remote Respiration Monitoring of Moving Person Using Radio Signals -- Camera Pose Estimation and Localization with Active Audio Sensing -- PACS: A Dataset for Physical Audiovisual Commonsense Reasoning -- VoViT: Low Latency Graph-Based Audio-Visual Voice Separation Transformer -- Telepresence Video Quality Assessment -- MultiMAE: Multi-modal Multi-task Masked Autoencoders -- AudioScopeV2: Audio-Visual Attention Architectures for Calibrated Open-Domain On-Screen Sound Separation -- AudioVisual Segmentation -- Unsupervised Night Image Enhancement: When Layer Decomposition Meets Light-Effects Suppression -- Relationformer: A Unified Framework for Image-to-Graph Generation -- GAMa: Cross-view Video Geo-localization -- Revisiting a kNN-based Image Classification System with High-capacity Storage -- Geometric Representation Learning for Document Image Rectification -- S2-VER: Semi-Supervised Visual Emotion Recognition -- Image Coding for Machines with Omnipotent Feature Learning -- Feature Representation Learning for Unsupervised Cross-Domain Image Retrieval -- Fashionformer: A Simple, Effective and Unified Baseline for Human Fashion Segmentation and Recognition -- Semantic-Guided Multi-Mask Image Harmonization -- Learning an Isometric Surface Parameterization for Texture Unwrapping -- Towards Regression-Free Neural Networks for Diverse Compute Platforms -- Relationship Spatialization for Depth Estimation -- Image2Point: 3D Point-Cloud Understanding with 2D Image Pretrained Models -- FAR: Fourier Aerial Video Recognition -- Translating a Visual LEGO Manual to a Machine-Executable Plan -- Fabric Material Recovery from Video Using Multi-Scale Geometric Auto-Encoder -- MegBA: A GPU-Based Distributed Library for Large-Scale Bundle Adjustment -- The One Where They Reconstructed 3D Humans and Environments in TV Shows.…”
    SpringerLink - Click here for access
    Conference Proceeding eBook
  6. 6

    Computer vision -- ECCV 2022 : 17th European Conference, Tel Aviv, Israel, October 23-27, 2022 : proceedings. Part XXVII by European Conference on Computer Vision Tel Aviv, Israel, SpringerLink (Online service)

    Published: Springer, 2022
    Description: 1 online resource (1 volume) : illustrations (black and white).
    Contents: “…Relative Contrastive Loss for Unsupervised Representation Learning -- Fine-Grained Fashion Representation Learning by Online Deep Clustering -- NashAE: Disentangling Representations through Adversarial Covariance Minimization -- A Gyrovector Space Approach for Symmetric Positive Semi-Definite Matrix Learning -- Learning Visual Representation from Modality-Shared Contrastive Language-Image Pre-training -- Contrasting Quadratic Assignments for Set-Based Representation Learning -- Class-Incremental Learning with Cross-Space Clustering and Controlled Transfer -- Object Discovery and Representation Networks -- Trading Positional Complexity vs Deepness in Coordinate Networks -- MVDG: A Unified Multi-View Framework for Domain Generalization -- Panoptic Scene Graph Generation -- Object-Compositional Neural Implicit Surfaces -- RigNet: Repetitive Image Guided Network for Depth Completion -- FADE: Fusing the Assets of Decoder and Encoder for Task-Agnostic Upsampling -- LiDAL: Inter-Frame Uncertainty Based Active Learning for 3D LiDAR Semantic Segmentation -- Hierarchical Memory Learning for Fine-Grained Scene Graph Generation -- DODA: Data-Oriented Sim-to-Real Domain Adaptation for 3D Semantic Segmentation -- MTFormer: Multi-task Learning via Transformer and Cross Task Reasoning -- MonoPLFlowNet: Permutohedral Lattice FlowNet for Real-Scale 3D Scene Flow Estimation with Monocular Images -- TO-Scene: A Large-Scale Dataset for Understanding 3D Tabletop Scenes -- Is It Necessary to Transfer Temporal Knowledge for Domain Adaptive Video Semantic Segmentation? …”
    SpringerLink - Click here for access
    Conference Proceeding eBook
  7. 7

    Computer vision - ECCV 2022 : 17th European Conference, Tel Aviv, Israel, October 23-27, 2022 : proceedings. Part XXV by European Conference on Computer Vision Tel Aviv, Israel, SpringerLink (Online service)

    Published: Springer, 2022
    Description: 1 online resource (lvi, 759 pages) : illustrations (chiefly color).
    Contents: “…Cross-Domain Ensemble Distillation for Domain Generalization -- Centrality and Consistency: Two-Stage Clean Samples Identification for Learning with Instance-Dependent Noisy Labels -- Hyperspherical Learning in Multi-Label Classification -- When Active Learning Meets Implicit Semantic Data Augmentation -- VL-LTR: Learning Class-Wise Visual-Linguistic Representation for Long-Tailed Visual Recognition -- Class Is Invariant to Context and Vice Versa: On Learning Invariance for Out-of-Distribution Generalization -- Hierarchical Semi-Supervised Contrastive Learning for ContaminationResistant Anomaly Detection -- Tracking by Associating Clips -- RealPatch: A Statistical Matching Framework for Model Patching with Real Samples -- Background-Insensitive Scene Text Recognition with Text Semantic Segmentation -- Semantic Novelty Detection via Relational Reasoning -- Improving Closed and Open-Vocabulary Attribute Prediction Using Transformers -- Training Vision Transformers with Only 2040 Images -- Bridging Images and Videos: A Simple Learning Framework for Large Vocabulary Video Object Detection -- TDAM: Top-Down Attention Module for Contextually Guided Feature Selection in CNNs -- Automatic Check-Out via Prototype-Based Classifier Learning from Single-Product Exemplars -- Overcoming Shortcut Learning in a Target Domain by Generalizing Basic Visual Factors from a Source Domain -- Photo-Realistic Neural Domain Randomization -- Wave-ViT: Unifying Wavelet and Transformers for Visual Representation Learning -- Tailoring Self-Supervision for Supervised Learning -- Difficulty-Aware Simulator for Open Set Recognition -- Few-Shot Class-Incremental Learning from an Open-Set Perspective -- FOSTER: Feature Boosting and Compression for Class-Incremental Learning -- Visual Knowledge Tracing -- S3C: Self-Supervised Stochastic Classifiers for Few-Shot ClassIncremental Learning -- Improving Fine-Grained Visual Recognition in Low Data Regimes via Self-Boosting Attention Mechanism -- VSA: Learning Varied-Size Window Attention in Vision Transformers -- Unbiased Manifold Augmentation for Coarse Class Subdivision -- DenseHybrid: Hybrid Anomaly Detection for Dense Open-Set Recognition -- Rethinking Confidence Calibration for Failure Prediction -- Uncertainty-Guided Source-Free Domain Adaptation -- Should All Proposals Be Treated Equally in Object Detection? …”
    SpringerLink - Click here for access
    Conference Proceeding eBook
  8. 8

    Computer vision - ECCV 2022 : 17th European Conference, Tel Aviv, Israel, October 23-27, 2022 : proceedings. Part XIV by European Conference on Computer Vision Tel Aviv, Israel, SpringerLink (Online service)

    Published: Springer, 2022
    Description: 1 online resource (lvi, 751 pages) : illustrations (chiefly color).
    Contents: “…Transferring Humans between Images with Semantic Cross Attention Modulation -- Sem2NeRF: Converting Single-View Semantic Masks to Neural Radiance Fields.…”
    SpringerLink - Click here for access
    Conference Proceeding eBook
  9. 9

    Computer vision -- ECCV 2022 : 17th European Conference, Tel Aviv, Israel, October 23-27, 2022 : proceedings. Part XXXIII by European Conference on Computer Vision Tel Aviv, Israel, SpringerLink (Online service)

    Published: Springer, 2022
    Description: 1 online resource : illustrations (black and white).
    Contents: “…SimpleRecon: 3D Reconstruction without 3D Convolutions -- Structure and Motion from Casual Videos -- What Matters for 3D Scene Flow Network -- Correspondence Reweighted Translation Averaging -- Neural Strands: Learning Hair Geometry and Appearance from Multi-View Images -- GraphCSPN: Geometry-Aware Depth Completion via Dynamic GCNs -- Objects Can Move: 3D Change Detection by Geometric Transformation Consistency -- Language-Grounded Indoor 3D Semantic Segmentation in the Wild -- Beyond Periodicity: Towards a Unifying Framework for Activations in Coordinate-MLPs -- Deforming Radiance Fields with Cages -- FLEX: Extrinsic Parameters-Free Multi-View 3D Human Motion Reconstruction -- MODE: Multi-View Omnidirectional Depth Estimation with 360 Cameras -- GigaDepth: Learning Depth from Structured Light with Branching Neural Networks -- ActiveNeRF: Learning Where to See with Uncertainty Estimation -- PoserNet: Refining Relative Camera Poses Exploiting Object Detections -- Gaussian Activated Neural Radiance Fields for High Fidelity Reconstruction & Pose Estimation -- Unbiased Gradient Estimation for Differentiable Surface Splatting via Poisson Sampling -- Towards Learning Neural Representations from Shadows -- Class-Incremental Novel Class Discovery -- Unknown-Oriented Learning for Open Set Domain Adaptation -- Prototype-Guided Continual Adaptation for Class-Incremental Unsupervised Domain Adaptation -- DecoupleNet: Decoupled Network for Domain Adaptive Semantic Segmentation -- Class-Agnostic Object Counting Robust to Intraclass Diversity -- Burn after Reading: Online Adaptation for Cross-Domain Streaming Data -- Mind the Gap in Distilling StyleGANs -- Improving Test-Time Adaptation via Shift-Agnostic Weight Regularization and Nearest Source Prototypes -- Learning Instance-Specific Adaptation for Cross-Domain Segmentation -- RegionCL: Exploring Contrastive Region Pairs for Self-Supervised Representation Learning -- Long-Tailed Class Incremental Learning -- DLCFT: Deep Linear Continual Fine-Tuning for General Incremental Learning -- Adversarial Partial Domain Adaptation by Cycle Inconsistency -- Combating Label Distribution Shift for Active Domain Adaptation -- GIPSO: Geometrically Informed Propagation for Online Adaptation in 3D LiDAR Segmentation -- CoSMix: Compositional Semantic Mix for Domain Adaptation in 3D LiDAR Segmentation -- A Unified Framework for Domain Adaptive Pose Estimation -- A Broad Study of Pre-training for Domain Generalization and Adaptation -- Prior Knowledge Guided Unsupervised Domain Adaptation -- GCISG: Guided Causal Invariant Learning for Improved Syn-to-Real Generalization -- AcroFOD: An Adaptive Method for Cross-Domain Few-Shot Object Detection -- Unsupervised Domain Adaptation for One-Stage Object Detector Using Offsets to Bounding Box -- Visual Prompt Tuning -- Quasi-Balanced Self-Training on Noise-Aware Synthesis of Object Point Clouds for Closing Domain Gap.…”
    SpringerLink - Click here for access
    Conference Proceeding eBook
  10. 10

    Computer vision - ECCV 2022 : 17th European Conference, Tel Aviv, Israel, October 23-27, 2022 : proceedings. Part XXXI by European Conference on Computer Vision Tel Aviv, Israel, SpringerLink (Online service)

    Published: Springer, 2022
    Description: 1 online resource (lvii, 753 pages) : illustrations (chiefly color).
    Contents: “…GOCA: Guided Online Cluster Assignment for Self-Supervised VideoRepresentation Learning -- Constrained Mean Shift Using Distant Yet Related Neighbors for Representation Learning -- Revisiting the Critical Factors of Augmentation-Invariant Representation Learning -- CA-SSL: Class-Agnostic Semi-Supervised Learning for Detection and Segmentation -- Dual Adaptive Transformations for Weakly Supervised Point Cloud Segmentation -- Semantic-Aware Fine-Grained Correspondence -- Self-Supervised Classification Network -- Data Invariants to Understand Unsupervised Out-of-Distribution Detection -- Domain Invariant Masked Autoencoders for Self-Supervised Learning from Multi-Domains -- Semi-Supervised Object Detection via Virtual Category Learning -- Completely Self-Supervised Crowd Counting via Distribution Matching -- Coarse-to-Fine Incremental Few-Shot Learning -- Learning Unbiased Transferability for Domain Adaptation by Uncertainty Modeling -- Learn2Augment: Learning to Composite Videos for Data Augmentation in Action Recognition -- CYBORGS: Contrastively Bootstrapping Object Representations by Grounding in Segmentation -- PSS: Progressive Sample Selection for Open-World Visual Representation Learning -- Improving Self-Supervised Lightweight Model Learning via Hard-Aware Metric Distillation -- Object Discovery via Contrastive Learning for Weakly Supervised Object Detection -- Stochastic Consensus: Enhancing Semi-Supervised Learning with Consistency of Stochastic Classifiers -- DiffuseMorph: Unsupervised Deformable Image Registration Using Diffusion Model -- Semi-Leak: Membership Inference Attacks against Semi-Supervised Learning -- OpenLDN: Learning to Discover Novel Classes for Open-World Semi-Supervised Learning -- Embedding Contrastive Unsupervised Features to Cluster in- and Out-of-Distribution Noise in Corrupted Image Datasets -- Unsupervised Few-Shot Image Classification by Learning Features into Clustering Space -- Towards Realistic Semi-Supervised Learning -- Masked Siamese Networks for Label-Efficient Learning -- Natural Synthetic Anomalies for Self-Supervised Anomaly Detection and Localization -- Understanding Collapse in Non-Contrastive Siamese Representation Learning -- Federated Self-Supervised Learning for Video Understanding -- Towards Efficient and Effective Self-Supervised Learning of Visual Representations -- DSR A Dual Subspace Re-Projection Network for Surface Anomaly Detection -- PseudoAugment: Learning to Use Unlabeled Data for Data Augmentation in Point Clouds -- MVSTER: Epipolar Transformer for Efficient Multi-View Stereo -- RelPose: Predicting Probabilistic Relative Rotation for Single Objects in the Wild -- R2L: Distilling Neural Radiance Field to Neural Light Field for Efficient Novel View Synthesis -- KD-MVS: Knowledge Distillation Based Self-Supervised Learning for Multi-View Stereo -- SALVe: Semantic Alignment Verification for Floorplan Reconstruction from Sparse Panoramas -- RC-MVSNet: Unsupervised Multi-View Stereo with Neural Rendering -- Box2Mask: Weakly Supervised 3D Semantic Instance Segmentation Using Bounding Boxes -- NeILF: Neural Incident Light Field for Physically-Based Material Estimation -- ARF: Artistic Radiance Fields -- Multiview Stereo with Cascaded Epipolar RAFT.…”
    SpringerLink - Click here for access
    Conference Proceeding eBook
  11. 11

    Computer vision - ECCV 2022 : 17th European Conference, Tel Aviv, Israel, October 23-27, 2022 : proceedings. Part XXXVIII by European Conference on Computer Vision Tel Aviv, Israel, SpringerLink (Online service)

    Published: Springer, 2022
    Description: 1 online resource (lvii, 763 pages) : illustrations (chiefly color).
    Contents: “…Transformer-Based Geo-Localization in the Wild -- Colorization for In Situ Marine Plankton Images -- Efficient Deep Visual and Inertial Odometry with Adaptive Visual Modality Selection -- A Sketch Is Worth a Thousand Words: Image Retrieval with Text and Sketch -- A Cloud 3D Dataset and Application-Specific Learned Image Compression in Cloud 3D -- AutoTransition: Learning to Recommend Video Transition Effects -- Online Segmentation of LiDAR Sequences: Dataset and Algorithm -- Open-World Semantic Segmentation for LIDAR Point Clouds -- KING: Generating Safety-Critical Driving Scenarios for Robust Imitation via Kinematics Gradients -- Differentiable Raycasting for Self-Supervised Occupancy Forecasting -- InAction: Interpretable Action Decision Making for Autonomous Driving -- CramNet: Camera-Radar Fusion with Ray-Constrained Cross-Attention for Robust 3D Object Detection -- CODA: A Real-World Road Corner Case Dataset for Object Detection in Autonomous Driving -- Motion Inspired Unsupervised Perception and Prediction in Autonomous Driving -- StretchBEV: Stretching Future Instance Prediction Spatially and Temporally -- RCLane: Relay Chain Prediction for Lane Detection -- Drive&Segment: Unsupervised Semantic Segmentation of Urban Scenes via Cross-Modal Distillation -- CenterFormer: Center-based Transformer for 3D Object Detection -- Physical Attack on Monocular Depth Estimation with Optimal Adversarial Patches -- ST-P3: End-to-End Vision-Based Autonomous Driving via Spatial-Temporal Feature Learning -- PersFormer: 3D Lane Detection via Perspective Transformer and the OpenLane Benchmark -- PointFix: Learning to Fix Domain Bias for Robust Online Stereo Adaptation -- BRNet: Exploring Comprehensive Features for Monocular Depth Estimation -- SiamDoGe: Domain Generalizable Semantic Segmentation Using Siamese Network -- Context-Aware Streaming Perception in Dynamic Environments -- Context-Aware Streaming Perception in Dynamic Environments -- Multimodal Transformer for Automatic 3D Annotation and Object Detection -- Dynamic 3D Scene Analysis by Point Cloud Accumulation -- Homogeneous Multi-modal Feature Fusion and Interaction for 3D Object Detection -- JPerceiver: Joint Perception Network for Depth, Pose and Layout Estimation in Driving Scenes -- Semi-Supervised 3D Object Detection with Proficient Teachers -- Point Cloud Compression with Sibling Context and Surface Priors.…”
    SpringerLink - Click here for access
    Conference Proceeding eBook
  12. 12

    Computer vision -- ECCV 2022 : 17th European Conference, Tel Aviv, Israel, October 23-27, 2022 : proceedings. Part XV by European Conference on Computer Vision Tel Aviv, Israel, SpringerLink (Online service)

    Published: Springer, 2022
    Description: 1 online resource : illustrations (black and white).
    Contents: “…Enabling Multi-Plane Images for Dynamic Scene Modelling via Temporal Basis Learning -- 3D-Aware Semantic-Guided Generative Model for Human Synthesis -- Temporally Consistent Semantic Video Editing -- Error Compensation Framework for Flow-Guided Video Inpainting -- Scraping Textures from Natural Images for Synthesis and Editing -- Single Stage Virtual Try-On via Deformable Attention Flows -- Improving GANs for Long-Tailed Data through Group Spectral Regularization -- Hierarchical Semantic Regularization of Latent Spaces in StyleGANs -- IntereStyle.Encoding an Interest Region for Robust StyleGAN Inversion -- StyleLight.HDR Panorama Generation for Lighting Estimation and Editing -- Contrastive Monotonic Pixel-Level Modulation -- Learning Cross-Video Neural Representations for High-Quality Frame Interpolation -- Learning Continuous Implicit Representation for Near-Periodic Patterns -- End-to-End Graph-Constrained Vectorized Floorplan Generation with Panoptic Refinement -- Few-Shot Image Generation with Mixup-Based Distance Learning -- A Style-Based GAN Encoder for High Fidelity Reconstruction of Images and Videos -- FakeCLR. …”
    SpringerLink - Click here for access
    Conference Proceeding eBook
  13. 13

    Computer vision -- ECCV 2022 : 17th European Conference, Tel Aviv, Israel, October 23-27, 2022 : proceedings. Part XXXVI by European Conference on Computer Vision Tel Aviv, Israel, SpringerLink (Online service)

    Published: Springer, 2022
    Description: 1 online resource (1 volume) : illustrations (black and white).
    Contents: “…Making the Most of Text Semantics to Improve Biomedical Vision-Language Processing -- Generative Negative Text Replay for Continual Vision-Language Pretraining -- Video Graph Transformer for Video Question Answering -- Trace Controlled Text to Image Generation -- Video Question Answering with Iterative Video-Text Co-Tokenization -- Rethinking Data Augmentation for Robust Visual Question Answering -- Explicit Image Caption Editing -- Can Shuffling Video Benefit Temporal Bias Problem: A Novel Training Framework for Temporal Grounding -- Reliable Visual Question Answering: Abstain Rather Than Answer Incorrectly -- GRIT: Faster and Better Image Captioning Transformer Using Dual Visual Features -- Selective Query-Guided Debiasing for Video Corpus Moment Retrieval -- Spatial and Visual Perspective-Taking via View Rotation and Relation Reasoning for Embodied Reference Understanding -- Object-Centric Unsupervised Image Captioning -- Contrastive Vision-Language Pre-training with Limited Resources -- Learning Linguistic Association towards Efficient Text-Video Retrieval -- ASSISTER: Assistive Navigation via Conditional Instruction Generation -- X-DETR: A Versatile Architecture for Instance-Wise Vision-Language Tasks -- Learning Disentanglement with Decoupled Labels for Vision-Language Navigation -- Switch-BERT: Learning to Model Multimodal Interactions by Switching Attention and Input -- Word-Level Fine-Grained Story Visualization -- Unifying Event Detection and Captioning as Sequence Generation via Pre-training -- Multimodal Transformer with Variable-Length Memory for Vision-and-Language Navigation -- Fine-Grained Visual Entailment -- Bottom Up Top down Detection Transformers for Language Grounding in Images and Point Clouds -- New Datasets and Models for Contextual Reasoning in Visual Dialog -- VisageSynTalk: Unseen Speaker Video-to-Speech Synthesis via Speech-Visage Feature Selection -- Classification-Regression for Chart Comprehension -- AssistQ: Affordance-Centric Question-Driven Task Completion for Egocentric Assistant -- FindIt: Generalized Localization with Natural Language Queries -- UniTAB: Unifying Text and Box Outputs for Grounded VisionLanguage Modeling -- Scaling Open-Vocabulary Image Segmentation with Image-Level Labels -- The Abduction of Sherlock Holmes: A Dataset for Visual Abductive Reasoning -- Speaker-Adaptive Lip Reading with User-Dependent Padding -- TISE: Bag of Metrics for Text-to-Image Synthesis Evaluation -- SemAug: Semantically Meaningful Image Augmentations for Object Detection through Language Grounding -- Referring Object Manipulation of Natural Images with Conditional Classifier-Free Guidance -- NewsStories: Illustrating Articles with Visual Summaries -- Webly Supervised Concept Expansion for General Purpose Vision Models -- FedVLN: Privacy-Preserving Federated Vision-and-Language Navigation -- CODER: Coupled Diversity-Sensitive Momentum Contrastive Learning for Image-Text Retrieval -- Language-Driven Artistic Style Transfer -- Single-Stream Multi-level Alignment for Vision-Language Pretraining.…”
    SpringerLink - Click here for access
    Conference Proceeding eBook
  14. 14

    Computer vision - ECCV 2022 : 17th European Conference, Tel Aviv, Israel, October 23-27, 2022 : proceedings. Part XII by European Conference on Computer Vision Tel Aviv, Israel, SpringerLink (Online service)

    Published: Springer, 2022
    Description: 1 online resource (lvii, 757 pages) : illustrations (chiefly color).
    Contents: “…Explicit Model Size Control and Relaxation via Smooth Regularization for Mixed-Precision Quantization -- BASQ: Branch-Wise Activation-Clipping Search Quantization for Sub-4-Bit Neural Networks -- You Already Have It: A Generator-Free Low-Precision DNN Training Framework Using Stochastic Rounding -- Real Spike: Learning Real-Valued Spikes for Spiking Neural Networks -- FedLTN: Federated Learning for Sparse and Personalized Lottery Ticket Networks -- Theoretical Understanding of the Information Flow on Continual Learning Performance -- Exploring Lottery Ticket Hypothesis in Spiking Neural Networks -- On the Angular Update and Hyperparameter Tuning of a Scale-Invariant Network -- LANA: Latency Aware Network Acceleration -- RDO-Q: Extremely Fine-Grained Channel-Wise Quantization via Rate-Distortion Optimization -- U-Boost NAS: Utilization-Boosted Differentiable Neural Architecture Search -- PTQ4ViT: Post-Training Quantization for Vision Transformers with Twin Uniform Quantization -- Bitwidth-Adaptive Quantization-Aware Neural Network Training: A Meta-Learning Approach -- Understanding the Dynamics of DNNs Using Graph Modularity -- Latent Discriminant Deterministic Uncertainty -- Making Heads or Tails: Towards Semantically Consistent Visual Counterfactuals -- HIVE: Evaluating the Human Interpretability of Visual Explanations -- BayesCap: Bayesian Identity Cap for Calibrated Uncertainty in Frozen Neural Networks -- SESS: Saliency Enhancing with Scaling and Sliding -- No Token Left Behind: Explainability-Aided Image Classification and Generation -- Interpretable Image Classification with Differentiable Prototypes Assignment -- Contributions of Shape, Texture, and Color in Visual Recognition -- STEEX: Steering Counterfactual Explanations with Semantics -- Are Vision Transformers Robust to Patch Perturbations? …”
    SpringerLink - Click here for access
    Conference Proceeding eBook
  15. 15

    Computer vision -- ECCV 2022 : 17th European Conference, Tel Aviv, Israel, October 23-27, 2022 : proceedings. Part XXXV by European Conference on Computer Vision Tel Aviv, Israel, SpringerLink (Online service)

    Published: Springer, 2022
    Description: 1 online resource : illustrations (black and white).
    Contents: “…Efficient One-Stage Video Object Detection by Exploiting Temporal Consistency -- Leveraging Action Affinity and Continuity for Semi-Supervised Temporal Action Segmentation -- Spotting Temporally Precise, Fine-Grained Events in Video -- Unified Fully and Timestamp Supervised Temporal Action Segmentation via Sequence to Sequence Translation -- Efficient Video Transformers with Spatial-Temporal Token Selection -- Long Movie Clip Classification with State-Space Video Models -- Prompting Visual-Language Models for Efficient Video Understanding -- Asymmetric Relation Consistency Reasoning for Video Relation Grounding -- Self-Supervised Social Relation Representation for Human Group Detection -- K-Centered Patch Sampling for Efficient Video Recognition -- A Deep Moving-Camera Background Model -- GraphVid: It Only Takes a Few Nodes to Understand a Video -- Delta Distillation for Efficient Video Processing -- MorphMLP: An Efficient MLP-Like Backbone for Spatial-Temporal Representation Learning -- COMPOSER: Compositional Reasoning of Group Activity in Videos with Keypoint-Only Modality -- E-NeRV: Expedite Neural Video Representation with Disentangled Spatial-Temporal Context -- TDViT: Temporal Dilated Video Transformer for Dense Video Tasks -- Semi-Supervised Learning of Optical Flow by Flow Supervisor -- Flow Graph to Video Grounding for Weakly-Supervised Multi-step Localization -- Deep 360 Optical Flow Estimation Based on Multi-Projection Fusion -- MaCLR: Motion-Aware Contrastive Learning of Representations for Videos -- Learning Long-Term Spatial-Temporal Graphs for Active Speaker Detection -- Frozen CLIP Models Are Efficient Video Learners -- PIP: Physical Interaction Prediction via Mental Simulation with Span Selection -- Panoramic Vision Transformer for Saliency Detection in 360 Videos -- Bayesian Tracking of Video Graphs Using Joint Kalman Smoothing and Registration -- Motion Sensitive Contrastive Learning for Self-Supervised Video Representation -- Dynamic Temporal Filtering In Video Models -- Tip-Adapter: Training-Free Adaption of CLIP for Few-Shot Classification -- Temporal Lift Pooling for Continuous Sign Language Recognition -- MORE: Multi-Order RElation Mining for Dense Captioning in 3D Scenes -- SiRi: A Simple Selective Retraining Mechanism for Transformer-Based Visual Grounding -- Cross-Modal Prototype Driven Network for Radiology Report Generation -- TM2T: Stochastic and Tokenized Modeling for the Reciprocal Generation of 3D Human Motions and Texts -- SeqTR: A Simple Yet Universal Network for Visual Grounding -- VTC: Improving Video-Text Retrieval with User Comments -- FashionViL: Fashion-Focused Vision-and-Language Representation Learning -- Weakly Supervised Grounding for VQA in Vision-Language Transformers -- Automatic Dense Annotation of Large-Vocabulary Sign Language Videos -- MILES: Visual BERT Pre-training with Injected Language Semantics for Video-Text Retrieval -- GEB+: A Benchmark for Generic Event Boundary Captioning, Grounding and Retrieval -- A Simple and Robust Correlation Filtering Method for Text-Based Person Search.…”
    SpringerLink - Click here for access
    Conference Proceeding eBook
  16. 16

    Computer vision - ECCV 2022 : 17th European Conference, Tel Aviv, Israel, October 23-27, 2022 : proceedings. Part XVII by European Conference on Computer Vision Tel Aviv, Israel, SpringerLink (Online service)

    Published: Springer, 2022
    Description: 1 online resource (lvii, 743 pages) : illustrations (chiefly color).
    Contents: “…Editing Out-of-Domain GAN Inversion via Differential Activations -- On the Robustness of Quality Measures for GANs -- Sound-Guided Semantic Video Generation -- Inpainting at Modern Camera Resolution by Guided PatchMatch with Auto-Curation -- Controllable Video Generation through Global and Local Motion Dynamics -- StyleHEAT: One-Shot High-Resolution Editable Talking Face Generation via Pre-trained StyleGAN -- Long Video Generation with Time-Agnostic VQGAN and Time-Sensitive Transformer -- Combining Internal and External Constraints for Unrolling Shutter in Videos -- WISE: Whitebox Image Stylization by Example-Based Learning -- Neural Radiance Transfer Fields for Relightable Novel-View Synthesis with Global Illumination -- Transformers As Meta-Learners for Implicit Neural Representations -- Style Your Hair: Latent Optimization for Pose-Invariant Hairstyle Transfer via Local-Style-Aware Hair Alignment -- High-Resolution Virtual Try-On with Misalignment and OcclusionHandled Conditions -- A Codec Information Assisted Framework for Efficient Compressed Video Super-Resolution -- Injecting 3D Perception of Controllable NeRF-GAN into StyleGAN for Editable Portrait Image Synthesis -- AdaNeRF: Adaptive Sampling for Real-Time Rendering of Neural Radiance Fields -- Improving the Perceptual Quality of 2D Animation Interpolation -- Selective TransHDR: Transformer-Based Selective HDR Imaging Using Ghost Region Mask -- Learning Series-Parallel Lookup Tables for Efficient Image Super-Resolution -- GeoAug: Data Augmentation for Few-Shot NeRF with Geometry Constraints -- DoodleFormer: Creative Sketch Drawing with Transformers -- Implicit Neural Representations for Variable Length Human Motion Generation -- Learning Object Placement via Dual-Path Graph Completion -- Expanded Adaptive Scaling Normalization for End to End Image Compression -- Generator Knows What Discriminator Should Learn in Unconditional GANs -- Compositional Visual Generation with Composable Diffusion Models -- ManiFest: Manifold Deformation for Few-Shot Image Translation -- ManiFest: Manifold Deformation for Few-Shot Image Translation -- Supervised Attribute Information Removal and Reconstruction for Image Manipulation -- BLT: Bidirectional Layout Transformer for Controllable Layout Generation -- Diverse Generation from a Single Video Made Possible -- Rayleigh EigenDirections (REDs): Nonlinear GAN Latent Space Traversals for Multidimensional Features -- Bridging the Domain Gap towards Generalization in Automatic Colorization -- Generating Natural Images with Direct Patch Distributions Matching -- Context-Consistent Semantic Image Editing with Style-Preserved Modulation -- Eliminating Gradient Conflict in Reference-Based Line-Art Colorization -- Unsupervised Learning of Efficient Geometry-Aware Neural Articulated Representations -- JPEG Artifacts Removal via Contrastive Representation Learning -- Unpaired Deep Image Dehazing Using Contrastive Disentanglement Learning -- Efficient Long-Range Attention Network for Image Super-Resolution -- FlowFormer: A Transformer Architecture for Optical Flow -- Coarse-to-Fine Sparse Transformer for Hyperspectral Image Reconstruction -- Learning Shadow Correspondence for Video Shadow Detection -- Metric Learning Based Interactive Modulation for Real-World Super-Resolution.…”
    SpringerLink - Click here for access
    Conference Proceeding eBook
  17. 17

    Computer vision - ECCV 2022 : 17th European Conference, Tel Aviv, Israel, October 23-27, 2022 : proceedings. Part XX by European Conference on Computer Vision Tel Aviv, Israel, SpringerLink (Online service)

    Published: Springer, 2022
    Description: 1 online resource (lvi, 759 pages) : illustrations (chiefly color).
    Contents: “…tSF: Transformer-Based Semantic Filter for Few-Shot Learning -- Adversarial Feature Augmentation for Cross-Domain Few-Shot Classification -- Constructing Balance from Imbalance for Long-Tailed Image Recognition -- On Multi-Domain Long-Tailed Recognition, Imbalanced Domain Generalization and Beyond -- Few-Shot Video Object Detection -- Worst Case Matters for Few-Shot Recognition -- Exploring Hierarchical Graph Representation for Large-Scale Zero-Shot Image Classification -- Doubly Deformable Aggregation of Covariance Matrices for Few-Shot Segmentation -- Dense Cross-Query-and-Support Attention Weighted Mask Aggregation for Few-Shot Segmentation -- Rethinking Clustering-Based Pseudo Labeling for Unsupervised Meta-Learning -- CLASTER: Clustering with Reinforcement Learning for Zero-Shot Action Recognition -- Few-Shot Class-Incremental Learning for 3D Point Cloud Objects -- Meta-Learning with Less Forgetting on Large-Scale Non-stationary Task Distributions -- DNA: Improving Few-Shot Transfer Learning with Low-Rank Decomposition and Alignment -- Learning Instance and Task-Aware Dynamic Kernels for Few Shot Learning -- Open-World Semantic Segmentation via Contrasting and Clustering Vision-Language Embedding -- Few-Shot Classification with Contrastive Learning -- Time-rEversed diffusioN tEnsor Transformer: A New TENET of Few-Shot Object Detection -- Self-Promoted Supervision for Few-Shot Transformer -- Few-Shot Object Counting and Detection -- Rethinking Few-Shot Object Detection on a Multi-Domain Benchmark -- Cross-Domain Cross-Set Few-Shot Learning via Learning Compact and Aligned Representations -- Mutually Reinforcing Structure with Proposal Contrastive Consistency for Few-Shot Object Detection -- Dual Contrastive Learning with Anatomical Auxiliary Supervision for Few-Shot Medical Image Segmentation -- Improving Few-Shot Learning through Multi-task Representation Learning Theory -- Tree Structure-Aware Few Shot Image Classification via Hierarchical Aggregation -- Inductive and Transductive Few Shot Video Classification via Appearance and Temporal Alignments -- Temporal and Cross-Modal Attention for Audio-Visual Zero-Shot Learning -- HM: Hybrid Masking for Few-Shot Segmentation -- TransVLAD: Focusing on Locally Aggregated Descriptors for Few-Shot Learning -- Kernel Relative-Prototype Spectral Filtering for Few-Shot Learning -- "This Is My Unicorn, Fluffy" : Personalizing Frozen Vision-Language Representations -- CLOSE: Curriculum Learning on the Sharing Extent towards Better One-Shot NAS -- Streamable Neural Fields -- Gradient-Based Uncertainty for Monocular Depth Estimation -- Online Continual Learning with Contrastive Vision Transformer -- CPrune: Compiler-Informed Model Pruning for Efficient Target-Aware DNN Execution -- EAutoDet: Efficient Architecture Search for Object Detection -- A Max-Flow Based Approach for Neural Architecture Search -- OccamNets: Mitigating Dataset Bias by Favoring Simpler Hypotheses -- ERA: Enhanced Rational Activations -- Convolutional Embedding Makes Hierarchical Vision Transformer Stronger.…”
    SpringerLink - Click here for access
    Conference Proceeding eBook
  18. 18

    Computer vision - ECCV 2022 : 17th European Conference, Tel Aviv, Israel, October 23-27, 2022 : proceedings. Part VI by European Conference on Computer Vision Tel Aviv, Israel, SpringerLink (Online service)

    Published: Springer, 2022
    Description: 1 online resource (lvi, 749 pages) : illustrations (chiefly color).
    Contents: “…UnrealEgo: A New Dataset for Robust Egocentric 3D Human Motion Capture -- Skeleton-Parted Graph Scattering Networks for 3D Human Motion Prediction -- Rethinking Keypoint Representations: Modeling Keypoints and Poses as Objects for Multi-Person Human Pose Estimation -- VirtualPose: Learning Generalizable 3D Human Pose Models from Virtual Data -- Poseur: Direct Human Pose Regression with Transformers -- SimCC: A Simple Coordinate Classification Perspective for Human Pose Estimation -- Regularizing Vector Embedding in Bottom-Up Human Pose Estimation -- A Visual Navigation Perspective for Category-Level Object Pose Estimation -- Faster VoxelPose: Real-Time 3D Human Pose Estimation by Orthographic Projection -- Learning to Fit Morphable Models -- EgoBody: Human Body Shape and Motion of Interacting People from Head-Mounted Devices -- GraspD: Differentiable Contact-Rich Grasp Synthesis for Multi-Fingered Hands -- AutoAvatar: Autoregressive Neural Fields for Dynamic Avatar Modeling -- Deep Radial Embedding for Visual Sequence Learning -- SAGA: Stochastic Whole-Body Grasping with Contact -- Neural Capture of Animatable 3D Human from Monocular Video -- General Object Pose Transformation Network from Unpaired Data -- Compositional Human-Scene Interaction Synthesis with Semantic Control -- PressureVision: Estimating Hand Pressure from a Single RGB Image -- PoseScript: 3D Human Poses from Natural Language -- DProST: Dynamic Projective Spatial Transformer Network for 6D Pose Estimation -- 3D Interacting Hand Pose Estimation by Hand De-Occlusion and Removal -- Pose for Everything: Towards Category-Agnostic Pose Estimation -- PoseGPT: Quantization-Based 3D Human Motion Generation and Forecasting -- DH-AUG: DH Forward Kinematics Model Driven Augmentation for 3D Human Pose Estimation -- Estimating Spatially-Varying Lighting in Urban Scenes with Disentangled Representation -- Boosting Event Stream Super-Resolution with a Recurrent Neural Network -- Projective Parallel Single-Pixel Imaging to Overcome Global Illumination in 3D Structure Light Scanning -- Semantic-Sparse Colorization Network for Deep Exemplar-Based Colorization -- Practical and Scalable Desktop-Based High-Quality Facial Capture -- FAST-VQA: Efficient End-to-End Video Quality Assessment with Fragment Sampling -- Physically-Based Editing of Indoor Scene Lighting from a Single Image -- LEDNet: Joint Low-Light Enhancement and Deblurring in the Dark -- MPIB: An MPI-Based Bokeh Rendering Framework for Realistic Partial Occlusion Effects -- Real-RawVSR: Real-World Raw Video Super-Resolution with a Benchmark Dataset -- Transform Your Smartphone into a DSLR Camera: Learning the ISP in the Wild -- Learning Deep Non-Blind Image Deconvolution without Ground Truths -- NEST: Neural Event Stack for Event-Based Image Enhancement -- Editable Indoor Lighting Estimation -- Fast Two-Step Blind Optical Aberration Correction -- Seeing Far in the Dark with Patterned Flash -- PseudoClick: Interactive Image Segmentation with Click Imitation.…”
    SpringerLink - Click here for access
    Conference Proceeding eBook
  19. 19

    Computer vision - ECCV 2022 : 17th European Conference, Tel Aviv, Israel, October 23-27, 2022 : proceedings. Part VII by European Conference on Computer Vision Tel Aviv, Israel, SpringerLink (Online service)

    Published: Springer, 2022
    Description: 1 online resource (lvii, 743 pages) : illustrations (chiefly color).
    Contents: “…UnrealEgo: A New Dataset for Robust Egocentric 3D Human Motion Capture -- Skeleton-Parted Graph Scattering Networks for 3D Human Motion Prediction -- Rethinking Keypoint Representations: Modeling Keypoints and Poses as Objects for Multi-Person Human Pose Estimation -- VirtualPose: Learning Generalizable 3D Human Pose Models from Virtual Data -- Poseur: Direct Human Pose Regression with Transformers -- SimCC: A Simple Coordinate Classification Perspective for Human Pose Estimation -- Regularizing Vector Embedding in Bottom-Up Human Pose Estimation -- A Visual Navigation Perspective for Category-Level Object Pose Estimation -- Faster VoxelPose: Real-Time 3D Human Pose Estimation by Orthographic Projection -- Learning to Fit Morphable Models -- EgoBody: Human Body Shape and Motion of Interacting People from Head-Mounted Devices -- GraspD: Differentiable Contact-Rich Grasp Synthesis for Multi-Fingered Hands -- AutoAvatar: Autoregressive Neural Fields for Dynamic Avatar Modeling -- Deep Radial Embedding for Visual Sequence Learning -- SAGA: Stochastic Whole-Body Grasping with Contact -- Neural Capture of Animatable 3D Human from Monocular Video -- General Object Pose Transformation Network from Unpaired Data -- Compositional Human-Scene Interaction Synthesis with Semantic Control -- PressureVision: Estimating Hand Pressure from a Single RGB Image -- PoseScript: 3D Human Poses from Natural Language -- 3D Interacting Hand Pose Estimation by Hand De-Occlusion and Removal -- Pose for Everything: Towards Category-Agnostic Pose Estimation -- PoseGPT: Quantization-Based 3D Human Motion Generation and Forecasting -- DH-AUG: DH Forward Kinematics Model Driven Augmentation for 3D Human Pose Estimation -- Estimating Spatially-Varying Lighting in Urban Scenes with Disentangled Representation -- Boosting Event Stream Super-Resolution with a Recurrent Neural Network -- Projective Parallel Single-Pixel Imaging to Overcome Global Illumination in 3D Structure Light Scanning -- Semantic-Sparse Colorization Network for Deep Exemplar-Based Colorization -- Practical and Scalable Desktop-Based High-Quality Facial Capture -- FAST-VQA: Efficient End-to-End Video Quality Assessment with Fragment Sampling -- Physically-Based Editing of Indoor Scene Lighting from a Single Image -- LEDNet: Joint Low-Light Enhancement and Deblurring in the Dark -- MPIB: An MPI-Based Bokeh Rendering Framework for Realistic Partial Occlusion Effects -- Real-RawVSR: Real-World Raw Video Super-Resolution with a Benchmark Dataset -- Transform Your Smartphone into a DSLR Camera: Learning the ISP in the Wild -- Learning Deep Non-Blind Image Deconvolution without Ground Truths -- NEST: Neural Event Stack for Event-Based Image Enhancement -- Editable Indoor Lighting Estimation -- Fast Two-Step Blind Optical Aberration Correction -- Seeing Far in the Dark with Patterned Flash -- PseudoClick: Interactive Image Segmentation with Click Imitation.…”
    SpringerLink - Click here for access
    Conference Proceeding eBook
  20. 20

    Computer vision - ECCV 2022 : 17th European Conference, Tel Aviv, Israel, October 23-27, 2022 : proceedings. Part I by European Conference on Computer Vision Tel Aviv, Israel, SpringerLink (Online service)

    Published: Springer, 2022
    Description: 1 online resource (lvi, 747 pages) : illustrations (chiefly color).
    Contents: “…Learning Depth from Focus in the Wild -- Learning-Based Point Cloud Registration for 6D Object Pose Estimation in the Real World -- An End-to-End Transformer Model for Crowd Localization -- Few-Shot Single-View 3D Reconstruction with Memory Prior Contrastive Network -- DID-M3D: Decoupling Instance Depth for Monocular 3D Object Detection -- Adaptive Co-Teaching for Unsupervised Monocular Depth Estimation -- Fusing Local Similarities for Retrieval-Based 3D Orientation Estimation of Unseen Objects -- Lidar Point Cloud Guided Monocular 3D Object Detection -- Structural Causal 3D Reconstruction -- 3D Human Pose Estimation Using Mbius Graph Convolutional Networks -- Learning to Train a Point Cloud Reconstruction Network without Matching -- PanoFormer: Panorama Transformer for Indoor 360 Depth Estimation -- Self-supervised Human Mesh Recovery with Cross-Representation Alignment -- AlignSDF: Pose-Aligned Signed Distance Fields for Hand-Object Reconstruction -- A Reliable Online Method for Joint Estimation of Focal Length and Camera Rotation -- PS-NeRF: Neural Inverse Rendering for Multi-View Photometric Stereo -- Share with Thy Neighbors: Single-View Reconstruction by Cross-Instance Consistency -- Towards Comprehensive Representation Enhancement in Semantics- Guided Self-Supervised Monocular Depth Estimation -- AvatarCap: Animatable Avatar Conditioned Monocular Human Volumetric Capture -- Cross-Attention of Disentangled Modalities for 3D Human Mesh Recovery with Transformers -- GeoRefine: Self-Supervised Online Depth Refinement for Accurate Dense Mapping -- Multi-modal Masked Pre-training for Monocular Panoramic Depth Completion -- GitNet: Geometric Prior-Based Transformation for Birds-Eye View Segmentation -- Learning Visibility for Robust Dense Human Body Estimation -- Towards High-Fidelity Single-View Holistic Reconstruction of Indoor Scenes -- CompNVS: Novel View Synthesis with Scene Completion -- SketchSampler: Sketch-Based 3D Reconstruction via View-Dependent Depth Sampling -- LocalBins: Improving Depth Estimation by Learning Local Distributions -- 2D GANs Meet Unsupervised Single-View 3D Reconstruction -- InfiniteNature-Zero: Learning Perpetual View Generation of Natural Scenes from Single Images -- Semi-Supervised Single-View 3D Reconstruction via Prototype Shape Priors -- Bilateral Normal Integration -- S2Contact: Graph-Based Network for 3D Hand-Object Contact Estimation with Semi-Supervised Learning -- SC-wLS: Towards Interpretable Feed-Forward Camera Re-localization -- FloatingFusion: Depth from ToF and Image-Stabilized Stereo Cameras -- DELTAR: Depth Estimation from a Light-Weight ToF Sensor and RGB Image -- 3D Room Layout Estimation from a Cubemap of Panorama Image via Deep Manhattan Hough Transform -- RBP-Pose: Residual Bounding Box Projection for Category-Level Pose Estimation -- Monocular 3D Object Reconstruction with GAN Inversion -- Map-Free Visual Relocalization: Metric Pose Relative to a Single Image -- Self-Distilled Feature Aggregation for Self-Supervised Monocular Depth Estimation -- Planes vs. …”
    SpringerLink - Click here for access
    Conference Proceeding eBook