Search Results - (((( Human-((spatial relationships) OR (spatial relations)) ) OR ( Human-social relationshipshipss ))) OR ((((( Human-clinical relationships ) OR ( Human-animal population ))) OR ( Human-tribal relations ))))*

  • Showing 1 - 10 results of 10
Refine Results
  1. 1

    Computer vision -- ECCV 2022 : 17th European Conference, Tel Aviv, Israel, October 23-27, 2022 : proceedings. Part XXXV by European Conference on Computer Vision Tel Aviv, Israel, SpringerLink (Online service)

    Published: Springer, 2022
    Description: 1 online resource : illustrations (black and white).
    Contents: “…Efficient One-Stage Video Object Detection by Exploiting Temporal Consistency -- Leveraging Action Affinity and Continuity for Semi-Supervised Temporal Action Segmentation -- Spotting Temporally Precise, Fine-Grained Events in Video -- Unified Fully and Timestamp Supervised Temporal Action Segmentation via Sequence to Sequence Translation -- Efficient Video Transformers with Spatial-Temporal Token Selection -- Long Movie Clip Classification with State-Space Video Models -- Prompting Visual-Language Models for Efficient Video Understanding -- Asymmetric Relation Consistency Reasoning for Video Relation Grounding -- Self-Supervised Social Relation Representation for Human Group Detection -- K-Centered Patch Sampling for Efficient Video Recognition -- A Deep Moving-Camera Background Model -- GraphVid: It Only Takes a Few Nodes to Understand a Video -- Delta Distillation for Efficient Video Processing -- MorphMLP: An Efficient MLP-Like Backbone for Spatial-Temporal Representation Learning -- COMPOSER: Compositional Reasoning of Group Activity in Videos with Keypoint-Only Modality -- E-NeRV: Expedite Neural Video Representation with Disentangled Spatial-Temporal Context -- TDViT: Temporal Dilated Video Transformer for Dense Video Tasks -- Semi-Supervised Learning of Optical Flow by Flow Supervisor -- Flow Graph to Video Grounding for Weakly-Supervised Multi-step Localization -- Deep 360 Optical Flow Estimation Based on Multi-Projection Fusion -- MaCLR: Motion-Aware Contrastive Learning of Representations for Videos -- Learning Long-Term Spatial-Temporal Graphs for Active Speaker Detection -- Frozen CLIP Models Are Efficient Video Learners -- PIP: Physical Interaction Prediction via Mental Simulation with Span Selection -- Panoramic Vision Transformer for Saliency Detection in 360 Videos -- Bayesian Tracking of Video Graphs Using Joint Kalman Smoothing and Registration -- Motion Sensitive Contrastive Learning for Self-Supervised Video Representation -- Dynamic Temporal Filtering In Video Models -- Tip-Adapter: Training-Free Adaption of CLIP for Few-Shot Classification -- Temporal Lift Pooling for Continuous Sign Language Recognition -- MORE: Multi-Order RElation Mining for Dense Captioning in 3D Scenes -- SiRi: A Simple Selective Retraining Mechanism for Transformer-Based Visual Grounding -- Cross-Modal Prototype Driven Network for Radiology Report Generation -- TM2T: Stochastic and Tokenized Modeling for the Reciprocal Generation of 3D Human Motions and Texts -- SeqTR: A Simple Yet Universal Network for Visual Grounding -- VTC: Improving Video-Text Retrieval with User Comments -- FashionViL: Fashion-Focused Vision-and-Language Representation Learning -- Weakly Supervised Grounding for VQA in Vision-Language Transformers -- Automatic Dense Annotation of Large-Vocabulary Sign Language Videos -- MILES: Visual BERT Pre-training with Injected Language Semantics for Video-Text Retrieval -- GEB+: A Benchmark for Generic Event Boundary Captioning, Grounding and Retrieval -- A Simple and Robust Correlation Filtering Method for Text-Based Person Search.…”
    SpringerLink - Click here for access
    Conference Proceeding eBook
  2. 2

    Computer vision -- ECCV 2022 : 17th European Conference, Tel Aviv, Israel, October 23-27, 2022 : proceedings. Part IV by European Conference on Computer Vision Tel Aviv, Israel, SpringerLink (Online service)

    Published: Springer, 2022
    Description: 1 online resource : illustrations (black and white).
    Contents: “…-- Learning Spatial-Preserved Skeleton Representations for Few-Shot Action Recognition -- Dual-Evidential Learning for Weakly-Supervised Temporal Action Localization -- Global-Local Motion Transformer for Unsupervised Skeleton-Based Action Learning -- AdaFocusV3: On Unified Spatial-Temporal Dynamic Video Recognition -- Panoramic Human Activity Recognition -- Delving into Details: Synopsis-to-Detail Networks for Video Recognition -- A Generalized & Robust Framework for Timestamp Supervision in Temporal Action Segmentation -- Few-Shot Action Recognition with Hierarchical Matching and Contrastive Learning -- PrivHAR: Recognizing Human Actions from Privacy-Preserving Lens -- Scale-Aware Spatio-Temporal Relation Learning for Video Anomaly Detection -- Compound Prototype Matching for Few-Shot Action Recognition -- Continual 3D Convolutional Neural Networks for Real-Time Processing of Videos -- Dynamic Spatio-Temporal Specialization Learning for Fine-Grained Action Recognition -- Dynamic Local Aggregation Network with Adaptive Clusterer for Anomaly Detection -- Action Quality Assessment with Temporal Parsing Transformer -- Entry-Flipped Transformer for Inference and Prediction of Participant Behavior -- Pairwise Contrastive Learning Network for Action Quality Assessment -- Geometric Features Informed Multi-Person Human-Object Interaction Recognition in Videos -- ActionFormer: Localizing Moments of Actions with Transformers -- SocialVAE: Human Trajectory Prediction Using Timewise Latents -- Shape Matters: Deformable Patch Attack -- Frequency Domain Model Augmentation for Adversarial Attack -- Prior-Guided Adversarial Initialization for Fast Adversarial Training -- Enhanced Accuracy and Robustness via Multi-Teacher Adversarial Distillation -- LGV: Boosting Adversarial Example Transferability from Large Geometric Vicinity -- A Large-Scale Multiple-Objective Method for Black-Box Attack against Object Detection -- GradAuto: Energy-Oriented Attack on Dynamic Neural Networks -- A Spectral View of Randomized Smoothing under Common Corruptions: Benchmarking and Improving Certified Robustness -- Improving Adversarial Robustness of 3D Point Cloud Classification Models -- Learning Extremely Lightweight and Robust Model with Differentiable Constraints on Sparsity and Condition Number -- RIBAC: Towards Robust and Imperceptible Backdoor Attack against Compact DNN -- Boosting Transferability of Targeted Adversarial Examples via Hierarchical Generative Networks.…”
    SpringerLink - Click here for access
    Conference Proceeding eBook
  3. 3

    Computer vision -- ECCV 2022 : 17th European Conference, Tel Aviv, Israel, October 23-27, 2022 : proceedings. Part XV by European Conference on Computer Vision Tel Aviv, Israel, SpringerLink (Online service)

    Published: Springer, 2022
    Description: 1 online resource : illustrations (black and white).
    Contents: “…Generalizing Image-Based Volumetric Avatars Using Relative Spatial Encoding of Keypoints -- ViewFormer: NeRF-Free Neural Rendering from Few Images Using Transformers -- L-Tracing. …”
    SpringerLink - Click here for access
    Conference Proceeding eBook
  4. 4

    Computer vision - ECCV 2022 : 17th European Conference, Tel Aviv, Israel, October 23-27, 2022, proceedings. Part XXII by European Conference on Computer Vision Tel Aviv, Israel, SpringerLink (Online service)

    Published: Springer, 2022
    Description: 1 online resource (lvi, 773 pages) : illustrations (chiefly color).
    Contents: “…ByteTrack: Multi-Object Tracking by Associating Every Detection Box -- Robust Multi-Object Tracking by Marginal Inference -- PolarMOT: How Far Can Geometric Relations Take Us in 3D Multi-Object Tracking? -- Particle Video Revisited: Tracking through Occlusions Using Point Trajectories -- Tracking Objects As Pixel-Wise Distributions -- CMT: Context-Matching-Guided Transformer for 3D Tracking in Point Clouds -- Towards Generic 3D Tracking in RGBD Videos: Benchmark and Baseline -- Hierarchical Latent Structure for Multi-modal Vehicle Trajectory Forecasting -- AiATrack: Attention in Attention for Transformer Visual Tracking -- Disentangling Architecture and Training for Optical Flow -- A Perturbation-Constrained Adversarial Attack for Evaluating the Robustness of Optical Flow -- Robust Landmark-Based Stent Tracking in X-Ray Fluoroscopy -- Social ODE: Multi-agent Trajectory Forecasting with Neural Ordinary Differential Equations -- Social-SSL: Self-Supervised Cross-Sequence Representation Learning Based on Transformers for Multi-agent Trajectory Prediction -- Diverse Human Motion Prediction Guided by Multi-level Spatial- Temporal Anchors -- Learning Pedestrian Group Representations for Multi-modal Trajectory Prediction -- Sequential Multi-View Fusion Network for Fast LiDAR Point Motion Estimation -- E-Graph: Minimal Solution for Rigid Rotation with Extensibility Graphs -- Point Cloud Compression with Range Image-Based Entropy Model for Autonomous Driving -- Joint Feature Learning and Relation Modeling for Tracking: A One-Stream Framework -- MotionCLIP: Exposing Human Motion Generation to CLIP Space -- Backbone Is All Your Need: A Simplified Architecture for Visual Object Tracking -- Aware of the History: Trajectory Forecasting with the Local Behavior Data -- Optical Flow Training under Limited Label Budget via Active Learning -- Hierarchical Feature Embedding for Visual Tracking -- Tackling Background Distraction in Video Object Segmentation -- Social-Implicit: Rethinking Trajectory Prediction Evaluation and the Effectiveness of Implicit Maximum Likelihood Estimation -- TEMOS: Generating Diverse Human Motions from Textual Descriptions -- Tracking Every Thing in the Wild -- HULC: 3D HUman Motion Capture with Pose Manifold SampLing and Dense Contact Guidance -- Towards Sequence-Level Training for Visual Tracking -- Learned Monocular Depth Priors in Visual-Inertial Initialization -- Robust Visual Tracking by Segmentation -- MeshLoc: Mesh-Based Visual Localization -- S2F2: Single-Stage Flow Forecasting for Future Multiple Trajectories Prediction -- Large-Displacement 3D Object Tracking with Hybrid Non-local Optimization -- FEAR: Fast, Efficient, Accurate and Robust Visual Tracker -- PREF: Predictability Regularized Neural Motion Fields -- View Vertically: A Hierarchical Network for Trajectory Prediction via Fourier Spectrums -- HVC-Net: Unifying Homography, Visibility, and Confidence Learning for Planar Object Tracking -- RamGAN: Region Attentive Morphing GAN for Region-Level Makeup Transfer -- SinNeRF: Training Neural Radiance Fields on Complex Scenes from a Single Image -- Entropy-Driven Sampling and Training Scheme for Conditional Diffusion Generation.…”
    SpringerLink - Click here for access
    Conference Proceeding eBook
  5. 5

    Computer vision -- ECCV 2022 : 17th European Conference, Tel Aviv, Israel, October 23-27, 2022 : proceedings. Part V by European Conference on Computer Vision Tel Aviv, Israel, SpringerLink (Online service)

    Published: Springer, 2022
    Description: 1 online resource : illustrations (black and white).
    Contents: “…Adaptive Image Transformations for Transfer-Based Adversarial Attack -- Generative Multiplane Images: Making a 2D GAN 3D-Aware -- AdvDO: Realistic Adversarial Attacks for Trajectory Prediction -- Adversarial Contrastive Learning via Asymmetric InfoNCE -- One Size Does NOT Fit All: Data-Adaptive Adversarial Training -- UniCR: Universally Approximated Certified Robustness via Randomized Smoothing -- Hardly Perceptible Trojan Attack against Neural Networks with Bit Flips -- Robust Network Architecture Search via Feature Distortion Restraining -- SecretGen: Privacy Recovery on Pre-trained Models via Distribution Discrimination -- Triangle Attack: A Query-Efficient Decision-Based Adversarial Attack -- Data-Free Backdoor Removal Based on Channel Lipschitzness -- Black-Box Dissector: Towards Erasing-Based Hard-Label Model Stealing Attack -- Learning Energy-Based Models with Adversarial Training -- Adversarial Label Poisoning Attack on Graph Neural Networks via Label Propagation -- Revisiting Outer Optimization in Adversarial Training -- Zero-Shot Attribute Attacks on Fine-Grained Recognition Models -- Towards Effective and Robust Neural Trojan Defenses via Input Filtering -- Scaling Adversarial Training to Large Perturbation Bounds -- Exploiting the Local Parabolic Landscapes of Adversarial Losses to Accelerate Black-Box Adversarial Attack -- Generative Domain Adaptation for Face Anti-Spoofing -- MetaGait: Learning to Learn an Omni Sample Adaptive Representation for Gait Recognition -- GaitEdge: Beyond Plain End-to-End Gait Recognition for Better Practicality -- UIA-ViT: Unsupervised Inconsistency-Aware Method Based on Vision Transformer for Face Forgery Detection -- Effective Presentation Attack Detection Driven by Face Related Task -- PPT: Token-Pruned Pose Transformer for Monocular and Multi-View Human Pose Estimation -- AvatarPoser: Articulated Full-Body Pose Tracking from Sparse Motion Sensing -- P-STMO: Pre-trained Spatial Temporal Many-to-One Model for 3D Human Pose Estimation -- D&D: Learning Human Dynamics from Dynamic Camera -- Explicit Occlusion Reasoning for Multi-Person 3D Human Pose Estimation -- COUCH: Towards Controllable Human-Chair Interactions -- Identity-Aware Hand Mesh Estimation and Personalization from RGB Images -- C3P: Cross-Domain Pose Prior Propagation for Weakly Supervised 3D Human Pose Estimation -- Pose-NDF: Modeling Human Pose Manifolds with Neural Distance Fields -- CLIFF: Carrying Location Information in Full Frames into Human Pose and Shape Estimation -- DeciWatch: A Simple Baseline for 10O Efficient 2D and 3D Pose Estimation -- SmoothNet: A Plug-and-Play Network for Refining Human Poses in Videos -- PoseTrans: A Simple yet Effective Pose Transformation Augmentation for Human Pose Estimation -- Multi-Person 3D Pose and Shape Estimation via Inverse Kinematics and Refinement -- Overlooked Poses Actually Make Sense: Distilling Privileged Knowledge for Human Motion Prediction -- Structural Triangulation: A Closed-Form Solution to Constrained 3D Human Pose Estimation -- Audio-Driven Stylized Gesture Generation with Flow-Based Model -- Self-Constrained Inference Optimization on Structural Groups for Human Pose Estimation.…”
    SpringerLink - Click here for access
    Conference Proceeding eBook
  6. 6

    Computer vision - ECCV 2022 : 17th European Conference, Tel Aviv, Israel, October 23-27, 2022 : proceedings. Part VIII by European Conference on Computer Vision Tel Aviv, Israel, SpringerLink (Online service)

    Published: Springer, 2022
    Description: 1 online resource (lvii, 751 pages) : illustrations (chiefly color).
    Contents: “…ECCV Caption: Correcting False Negatives by Collecting Machine-and-Human-Verified Image-Caption Associations for MS-COCO -- MOTCOM: The Multi-Object Tracking Dataset Complexity Metric -- How to Synthesize a Large-Scale and Trainable Micro-Expression Dataset? …”
    SpringerLink - Click here for access
    Conference Proceeding eBook
  7. 7

    Computer vision - ECCV 2022 : 17th European Conference, Tel Aviv, Israel, October 23-27, 2022 : proceedings. Part XXXVII by European Conference on Computer Vision Tel Aviv, Israel, SpringerLink (Online service)

    Published: Springer, 2022
    Description: 1 online resource (lvi, 753 pages) : illustrations (chiefly color).
    Contents: “…Most and Least Retrievable Images in Visual-Language Query Systems -- Sports Video Analysis on Large-Scale Data -- Grounding Visual Representations with Texts for Domain Generalization -- Bridging the Visual Semantic Gap in VLN via Semantically Richer Instructions -- StoryDALL-E: Adapting Pretrained Text-to-Image Transformers for Story Continuation -- VQGAN-CLIP: Open Domain Image Generation and Editing with Natural Language Guidance -- Semantic-Aware Implicit Neural Audio-Driven Video Portrait Generation -- End-to-End Active Speaker Detection -- Emotion Recognition for Multiple Context Awareness -- Adaptive Fine-Grained Sketch-Based Image Retrieval -- Quantized GAN for Complex Music Generation from Dance Videos -- Uncertainty-Aware Multi-modal Learning via Cross-Modal Random Network Prediction -- Localizing Visual Sounds the Easy Way -- Learning Visual Styles from Audio-Visual Associations -- Remote Respiration Monitoring of Moving Person Using Radio Signals -- Camera Pose Estimation and Localization with Active Audio Sensing -- PACS: A Dataset for Physical Audiovisual Commonsense Reasoning -- VoViT: Low Latency Graph-Based Audio-Visual Voice Separation Transformer -- Telepresence Video Quality Assessment -- MultiMAE: Multi-modal Multi-task Masked Autoencoders -- AudioScopeV2: Audio-Visual Attention Architectures for Calibrated Open-Domain On-Screen Sound Separation -- AudioVisual Segmentation -- Unsupervised Night Image Enhancement: When Layer Decomposition Meets Light-Effects Suppression -- Relationformer: A Unified Framework for Image-to-Graph Generation -- GAMa: Cross-view Video Geo-localization -- Revisiting a kNN-based Image Classification System with High-capacity Storage -- Geometric Representation Learning for Document Image Rectification -- S2-VER: Semi-Supervised Visual Emotion Recognition -- Image Coding for Machines with Omnipotent Feature Learning -- Feature Representation Learning for Unsupervised Cross-Domain Image Retrieval -- Fashionformer: A Simple, Effective and Unified Baseline for Human Fashion Segmentation and Recognition -- Semantic-Guided Multi-Mask Image Harmonization -- Learning an Isometric Surface Parameterization for Texture Unwrapping -- Towards Regression-Free Neural Networks for Diverse Compute Platforms -- Relationship Spatialization for Depth Estimation -- Image2Point: 3D Point-Cloud Understanding with 2D Image Pretrained Models -- FAR: Fourier Aerial Video Recognition -- Translating a Visual LEGO Manual to a Machine-Executable Plan -- Fabric Material Recovery from Video Using Multi-Scale Geometric Auto-Encoder -- MegBA: A GPU-Based Distributed Library for Large-Scale Bundle Adjustment -- The One Where They Reconstructed 3D Humans and Environments in TV Shows.…”
    SpringerLink - Click here for access
    Conference Proceeding eBook
  8. 8

    Computer vision -- ECCV 2022 : 17th European Conference, Tel Aviv, Israel, October 23-27, 2022 : proceedings. Part XXXVI by European Conference on Computer Vision Tel Aviv, Israel, SpringerLink (Online service)

    Published: Springer, 2022
    Description: 1 online resource (1 volume) : illustrations (black and white).
    SpringerLink - Click here for access
    Conference Proceeding eBook
  9. 9

    Computer vision - ECCV 2022 : 17th European Conference, Tel Aviv, Israel, October 23-27, 2022 : proceedings. Part XXIX by European Conference on Computer Vision Tel Aviv, Israel, SpringerLink (Online service)

    Published: Springer, 2022
    Description: 1 online resource (lvi, 757 pages) : illustrations (chiefly color).
    SpringerLink - Click here for access
    Conference Proceeding eBook
  10. 10

    Computer vision -- ECCV 2022 : 17th European Conference, Tel Aviv, Israel, October 23-27, 2022 : proceedings. Part XXIII by European Conference on Computer Vision Tel Aviv, Israel, SpringerLink (Online service)

    Published: Springer Nature Switzerland, 2022
    Description: 1 online resource (1 volume) : illustrations (black and white).
    Contents: “…Accelerating Score-Based Generative Models with Preconditioned Diffusion Sampling -- Learning to Generate Realistic LiDAR Point Clouds -- RFNet-4D: Joint Object Reconstruction and Flow Estimation from 4D Point Clouds -- Diverse Image Inpainting with Normalizing Flow -- Improved Masked Image Generation with Token-Critic -- TREND: Truncated Generalized Normal Density Estimation of Inception Embeddings for GAN Evaluation -- Exploring Gradient-Based Multi-directional Controls in GANs -- Spatially Invariant Unsupervised 3D Object-Centric Learning and Scene Decomposition -- Neural Scene Decoration from a Single Photograph -- Outpainting by Queries -- Unleashing Transformers: Parallel Token Prediction with Discrete Absorbing Diffusion for Fast High-Resolution Image Generation from Vector-Quantized Codes -- ChunkyGAN: Real Image Inversion via Segments -- GAN Cocktail: Mixing GANs without Dataset Access -- Geometry-Guided Progressive NeRF for Generalizable and Efficient Neural Human Rendering -- Controllable Shadow Generation Using Pixel Height Maps -- Learning Where to Look Generative NAS Is Surprisingly Efficient -- Subspace Diffusion Generative Models -- DuelGAN: A Duel between Two Discriminators Stabilizes the GAN Training -- MINER: Multiscale Implicit Neural Representation -- An Embedded Feature Whitening Approach to Deep Neural Network Optimization -- Q-FW: A Hybrid Classical-Quantum Frank-Wolfe for Quadratic Binary Optimization -- Self-Supervised Learning of Visual Graph Matching -- Scalable Learning to Optimize: A Learned Optimizer Can Train Big Models -- QISTA-ImageNet: A Deep Compressive Image Sensing Framework Solving q-Norm Optimization Problem -- R-DFCIL: Relation-Guided Representation Learning for Data-Free Class Incremental Learning -- Domain Generalization by Mutual-Information Regularization with Pre-trained Models -- Predicting Is Not Understanding: Recognizing and Addressing Underspecification in Machine Learning -- Neural-Sim: Learning to Generate Training Data with NeRF -- Bayesian Optimization with Clustering and Rollback for CNN Auto Pruning -- Learned Variational Video Color Propagation -- Continual Variational Autoencoder Learning via Online Cooperative Memorization -- Learning to Learn with Smooth Regularization -- Incremental Task Learning with Incremental Rank Updates -- Batch-Efficient EigenDecomposition for Small and Medium Matrices -- Ensemble Learning Priors Driven Deep Unfolding for Scalable Video Snapshot Compressive Imaging -- Approximate Discrete Optimal Transport Plan with Auxiliary Measure Method -- A Comparative Study of Graph Matching Algorithms in Computer Vision -- Improving Generalization in Federated Learning by Seeking Flat Minima -- Semidefinite Relaxations of Truncated Least-Squares in Robust Rotation Search: Tight or Not -- Transfer without Forgetting -- AdaBest: Minimizing Client Drift in Federated Learning via Adaptive Bias Estimation -- Tackling Long-Tailed Category Distribution under Domain Shifts -- Doubly-Fused ViT: Fuse Information from Vision Transformer Doubly with Local Representation.…”
    SpringerLink - Click here for access
    Conference Proceeding eBook