Search Results - (((((( Human-school relationships ) OR ( Human-social relations ))) OR ( Human-tribal nations ))) OR ( Human-((fiscal relations) OR (local relations)) ))

Refine Results
  1. 1

    Motion history images for action recognition and understanding by Ahad, Md. Atiqur Rahman

    Published: Springer, 2013
    Description: 1 online resource (131 pages).
    SpringerLink - Click here for access
    eBook
  2. 2

    Computer vision systems : 12th international conference, ICVS 2019, Thessaloniki, Greece, September 23-25, 2019, proceedings by ICVS (Conference : Computer vision systems) Thessalonikē, Greece), SpringerLink (Online service)

    Published: Springer, 2019
    Description: 1 online resource (xviii, 799 pages) : illustrations (some color).
    Contents:
    SpringerLink - Click here for access
    Conference Proceeding eBook
  3. 3

    Perception and machine intelligence : first Indo-Japan Conference, PerMIn 2012, Kolkata, India, January 12-13, 2012, proceedings by PerMIn (Conference) Kolkata, India), SpringerLink (Online service)

    Published: Springer, 2012
    Description: 1 online resource (xvii, 380 pages) : illustrations.
    Contents: “…Smart, Sensing System for Human Emotion and Behaviour Recognition /…”
    SpringerLink - Click here for access
    Conference Proceeding eBook
  4. 4
  5. 5
  6. 6
  7. 7
  8. 8

    Pattern Recognition : ICPR International Workshops and Challenges, virtual event, January 10-15, 2021, proceedings. Part II by International Conference on Pattern Recognition Online, SpringerLink (Online service)

    Published: Springer, 2021
    Description: 1 online resource (xx, 753 pages) : illustrations (some color).
    Contents: “…Mutual Use of Semantics and Geometry for CNN-Based Object Localization in ToF Images /…”
    SpringerLink - Click here for access
    Conference Proceeding eBook
  9. 9

    Computer vision - ECCV 2022 : 17th European Conference, Tel Aviv, Israel, October 23-27, 2022, proceedings. Part XXII by European Conference on Computer Vision Tel Aviv, Israel, SpringerLink (Online service)

    Published: Springer, 2022
    Description: 1 online resource (lvi, 773 pages) : illustrations (chiefly color).
    Contents: “…ByteTrack: Multi-Object Tracking by Associating Every Detection Box -- Robust Multi-Object Tracking by Marginal Inference -- PolarMOT: How Far Can Geometric Relations Take Us in 3D Multi-Object Tracking? -- Particle Video Revisited: Tracking through Occlusions Using Point Trajectories -- Tracking Objects As Pixel-Wise Distributions -- CMT: Context-Matching-Guided Transformer for 3D Tracking in Point Clouds -- Towards Generic 3D Tracking in RGBD Videos: Benchmark and Baseline -- Hierarchical Latent Structure for Multi-modal Vehicle Trajectory Forecasting -- AiATrack: Attention in Attention for Transformer Visual Tracking -- Disentangling Architecture and Training for Optical Flow -- A Perturbation-Constrained Adversarial Attack for Evaluating the Robustness of Optical Flow -- Robust Landmark-Based Stent Tracking in X-Ray Fluoroscopy -- Social ODE: Multi-agent Trajectory Forecasting with Neural Ordinary Differential Equations -- Social-SSL: Self-Supervised Cross-Sequence Representation Learning Based on Transformers for Multi-agent Trajectory Prediction -- Diverse Human Motion Prediction Guided by Multi-level Spatial- Temporal Anchors -- Learning Pedestrian Group Representations for Multi-modal Trajectory Prediction -- Sequential Multi-View Fusion Network for Fast LiDAR Point Motion Estimation -- E-Graph: Minimal Solution for Rigid Rotation with Extensibility Graphs -- Point Cloud Compression with Range Image-Based Entropy Model for Autonomous Driving -- Joint Feature Learning and Relation Modeling for Tracking: A One-Stream Framework -- MotionCLIP: Exposing Human Motion Generation to CLIP Space -- Backbone Is All Your Need: A Simplified Architecture for Visual Object Tracking -- Aware of the History: Trajectory Forecasting with the Local Behavior Data -- Optical Flow Training under Limited Label Budget via Active Learning -- Hierarchical Feature Embedding for Visual Tracking -- Tackling Background Distraction in Video Object Segmentation -- Social-Implicit: Rethinking Trajectory Prediction Evaluation and the Effectiveness of Implicit Maximum Likelihood Estimation -- TEMOS: Generating Diverse Human Motions from Textual Descriptions -- Tracking Every Thing in the Wild -- HULC: 3D HUman Motion Capture with Pose Manifold SampLing and Dense Contact Guidance -- Towards Sequence-Level Training for Visual Tracking -- Learned Monocular Depth Priors in Visual-Inertial Initialization -- Robust Visual Tracking by Segmentation -- MeshLoc: Mesh-Based Visual Localization -- S2F2: Single-Stage Flow Forecasting for Future Multiple Trajectories Prediction -- Large-Displacement 3D Object Tracking with Hybrid Non-local Optimization -- FEAR: Fast, Efficient, Accurate and Robust Visual Tracker -- PREF: Predictability Regularized Neural Motion Fields -- View Vertically: A Hierarchical Network for Trajectory Prediction via Fourier Spectrums -- HVC-Net: Unifying Homography, Visibility, and Confidence Learning for Planar Object Tracking -- RamGAN: Region Attentive Morphing GAN for Region-Level Makeup Transfer -- SinNeRF: Training Neural Radiance Fields on Complex Scenes from a Single Image -- Entropy-Driven Sampling and Training Scheme for Conditional Diffusion Generation.…”
    SpringerLink - Click here for access
    Conference Proceeding eBook
  10. 10

    Computer vision -- ECCV 2022 : 17th European Conference, Tel Aviv, Israel, October 23-27, 2022 : proceedings. Part IV by European Conference on Computer Vision Tel Aviv, Israel, SpringerLink (Online service)

    Published: Springer, 2022
    Description: 1 online resource : illustrations (black and white).
    Contents: “…-- Learning Spatial-Preserved Skeleton Representations for Few-Shot Action Recognition -- Dual-Evidential Learning for Weakly-Supervised Temporal Action Localization -- Global-Local Motion Transformer for Unsupervised Skeleton-Based Action Learning -- AdaFocusV3: On Unified Spatial-Temporal Dynamic Video Recognition -- Panoramic Human Activity Recognition -- Delving into Details: Synopsis-to-Detail Networks for Video Recognition -- A Generalized & Robust Framework for Timestamp Supervision in Temporal Action Segmentation -- Few-Shot Action Recognition with Hierarchical Matching and Contrastive Learning -- PrivHAR: Recognizing Human Actions from Privacy-Preserving Lens -- Scale-Aware Spatio-Temporal Relation Learning for Video Anomaly Detection -- Compound Prototype Matching for Few-Shot Action Recognition -- Continual 3D Convolutional Neural Networks for Real-Time Processing of Videos -- Dynamic Spatio-Temporal Specialization Learning for Fine-Grained Action Recognition -- Dynamic Local Aggregation Network with Adaptive Clusterer for Anomaly Detection -- Action Quality Assessment with Temporal Parsing Transformer -- Entry-Flipped Transformer for Inference and Prediction of Participant Behavior -- Pairwise Contrastive Learning Network for Action Quality Assessment -- Geometric Features Informed Multi-Person Human-Object Interaction Recognition in Videos -- ActionFormer: Localizing Moments of Actions with Transformers -- SocialVAE: Human Trajectory Prediction Using Timewise Latents -- Shape Matters: Deformable Patch Attack -- Frequency Domain Model Augmentation for Adversarial Attack -- Prior-Guided Adversarial Initialization for Fast Adversarial Training -- Enhanced Accuracy and Robustness via Multi-Teacher Adversarial Distillation -- LGV: Boosting Adversarial Example Transferability from Large Geometric Vicinity -- A Large-Scale Multiple-Objective Method for Black-Box Attack against Object Detection -- GradAuto: Energy-Oriented Attack on Dynamic Neural Networks -- A Spectral View of Randomized Smoothing under Common Corruptions: Benchmarking and Improving Certified Robustness -- Improving Adversarial Robustness of 3D Point Cloud Classification Models -- Learning Extremely Lightweight and Robust Model with Differentiable Constraints on Sparsity and Condition Number -- RIBAC: Towards Robust and Imperceptible Backdoor Attack against Compact DNN -- Boosting Transferability of Targeted Adversarial Examples via Hierarchical Generative Networks.…”
    SpringerLink - Click here for access
    Conference Proceeding eBook
  11. 11

    Computer vision - ECCV 2022 : 17th European Conference, Tel Aviv, Israel, October 23-27, 2022 : proceedings. Part I by European Conference on Computer Vision Tel Aviv, Israel, SpringerLink (Online service)

    Published: Springer, 2022
    Description: 1 online resource (lvi, 747 pages) : illustrations (chiefly color).
    Contents: “…Learning Depth from Focus in the Wild -- Learning-Based Point Cloud Registration for 6D Object Pose Estimation in the Real World -- An End-to-End Transformer Model for Crowd Localization -- Few-Shot Single-View 3D Reconstruction with Memory Prior Contrastive Network -- DID-M3D: Decoupling Instance Depth for Monocular 3D Object Detection -- Adaptive Co-Teaching for Unsupervised Monocular Depth Estimation -- Fusing Local Similarities for Retrieval-Based 3D Orientation Estimation of Unseen Objects -- Lidar Point Cloud Guided Monocular 3D Object Detection -- Structural Causal 3D Reconstruction -- 3D Human Pose Estimation Using Mbius Graph Convolutional Networks -- Learning to Train a Point Cloud Reconstruction Network without Matching -- PanoFormer: Panorama Transformer for Indoor 360 Depth Estimation -- Self-supervised Human Mesh Recovery with Cross-Representation Alignment -- AlignSDF: Pose-Aligned Signed Distance Fields for Hand-Object Reconstruction -- A Reliable Online Method for Joint Estimation of Focal Length and Camera Rotation -- PS-NeRF: Neural Inverse Rendering for Multi-View Photometric Stereo -- Share with Thy Neighbors: Single-View Reconstruction by Cross-Instance Consistency -- Towards Comprehensive Representation Enhancement in Semantics- Guided Self-Supervised Monocular Depth Estimation -- AvatarCap: Animatable Avatar Conditioned Monocular Human Volumetric Capture -- Cross-Attention of Disentangled Modalities for 3D Human Mesh Recovery with Transformers -- GeoRefine: Self-Supervised Online Depth Refinement for Accurate Dense Mapping -- Multi-modal Masked Pre-training for Monocular Panoramic Depth Completion -- GitNet: Geometric Prior-Based Transformation for Birds-Eye View Segmentation -- Learning Visibility for Robust Dense Human Body Estimation -- Towards High-Fidelity Single-View Holistic Reconstruction of Indoor Scenes -- CompNVS: Novel View Synthesis with Scene Completion -- SketchSampler: Sketch-Based 3D Reconstruction via View-Dependent Depth Sampling -- LocalBins: Improving Depth Estimation by Learning Local Distributions -- 2D GANs Meet Unsupervised Single-View 3D Reconstruction -- InfiniteNature-Zero: Learning Perpetual View Generation of Natural Scenes from Single Images -- Semi-Supervised Single-View 3D Reconstruction via Prototype Shape Priors -- Bilateral Normal Integration -- S2Contact: Graph-Based Network for 3D Hand-Object Contact Estimation with Semi-Supervised Learning -- SC-wLS: Towards Interpretable Feed-Forward Camera Re-localization -- FloatingFusion: Depth from ToF and Image-Stabilized Stereo Cameras -- DELTAR: Depth Estimation from a Light-Weight ToF Sensor and RGB Image -- 3D Room Layout Estimation from a Cubemap of Panorama Image via Deep Manhattan Hough Transform -- RBP-Pose: Residual Bounding Box Projection for Category-Level Pose Estimation -- Monocular 3D Object Reconstruction with GAN Inversion -- Map-Free Visual Relocalization: Metric Pose Relative to a Single Image -- Self-Distilled Feature Aggregation for Self-Supervised Monocular Depth Estimation -- Planes vs. …”
    SpringerLink - Click here for access
    Conference Proceeding eBook
  12. 12
  13. 13

    Computer vision - ECCV 2022 : 17th European Conference, Tel Aviv, Israel, October 23-27, 2022 : proceedings. Part XXXII by European Conference on Computer Vision Tel Aviv, Israel, SpringerLink (Online service)

    Published: Springer, 2022
    Description: 1 online resource (lvi, 741 pages) : illustrations (chiefly color).
    Contents: “…ARAH: Animatable Volume Rendering of Articulated Human SDFs -- ASpanFormer: Detector-Free Image Matching with Adaptive Span Transformer -- NDF: Neural Deformable Fields for Dynamic Human Modelling -- Neural Density-Distance Fields -- NeXT: Towards High Quality Neural Radiance Fields via Multi-Skip Transformer -- Learning Online Multi-sensor Depth Fusion -- BungeeNeRF: Progressive Neural Radiance Field for Extreme Multi-Scale Scene Rendering -- Decomposing the Tangent of Occluding Boundaries according to Curvatures and Torsions -- NeuRIS: Neural Reconstruction of Indoor Scenes Using Normal Priors -- Generalizable Patch-Based Neural Rendering -- Improving RGB-D Point Cloud Registration by Learning Multi-Scale Local Linear Transformation -- Real-Time Neural Character Rendering with Pose-Guided Multiplane Images -- SparseNeuS: Fast Generalizable Neural Surface Reconstruction from Sparse Views -- Disentangling Object Motion and Occlusion for Unsupervised Multi-Frame Monocular Depth -- Depth Field Networks for Generalizable Multi-View Scene Representation -- Context-Enhanced Stereo Transformer -- PCW-Net: Pyramid Combination and Warping Cost Volume for Stereo Matching -- Gen6D: Generalizable Model-Free 6-DoF Object Pose Estimation from RGB Images -- Latency-Aware Collaborative Perception -- TensoRF: Tensorial Radiance Fields -- NeFSAC: Neurally Filtered Minimal Samples -- SNeS: Learning Probably Symmetric Neural Surfaces from Incomplete Data -- HDR-Plenoxels: Self-Calibrating High Dynamic Range Radiance Fields -- NeuMan: Neural Human Radiance Field from a Single Video -- TAVA: Template-Free Animatable Volumetric Actors -- EASNet: Searching Elastic and Accurate Network Architecture for Stereo Matching -- Relative Pose from SIFT Features -- Selection and Cross Similarity for Event-Image Deep Stereo -- D3Net: A Unified Speaker-Listener Architecture for 3D Dense Captioning and Visual Grounding -- CIRCLE: Convolutional Implicit Reconstruction and Completion for Large-Scale Indoor Scene -- ParticleSfM: Exploiting Dense Point Trajectories for Localizing Moving Cameras in the Wild -- 4DContrast: Contrastive Learning with Dynamic Correspondences for 3D Scene Understanding -- Few Zero Level Set-Shot Learning of Shape Signed Distance Functions in Feature Space -- Solution Space Analysis of Essential Matrix Based on Algebraic Error Minimization -- Approximate Differentiable Rendering with Algebraic Surfaces -- CoVisPose: Co-Visibility Pose Transformer for Wide-Baseline Relative Pose Estimation in 360 Indoor Panoramas -- Affine Correspondences between Multi-Camera Systems for 6DOF Relative Pose Estimation -- GraphFit: Learning Multi-Scale Graph-Convolutional Representation for Point Cloud Normal Estimation -- IS-MVSNet: Importance Sampling-Based MVSNet -- Point Scene Understanding via Disentangled Instance Mesh Reconstruction -- DiffuStereo: High Quality Human Reconstruction via Diffusion-Based Stereo Using Sparse Cameras -- Space-Partitioning RANSAC.…”
    SpringerLink - Click here for access
    Conference Proceeding eBook
  14. 14

    Computer vision - ECCV 2022 : 17th European Conference, Tel Aviv, Israel, October 23-27, 2022 : proceedings. Part XXXIV by European Conference on Computer Vision Tel Aviv, Israel, SpringerLink (Online service)

    Published: Springer, 2022
    Description: 1 online resource (lvi, 763 pages) : illustrations (chiefly color).
    Contents: “…-- Attention Diversification for Domain Generalization -- ESS: Learning Event-Based Semantic Segmentation from Still Images -- An Efficient Spatio-Temporal Pyramid Transformer for Action Detection -- Human Trajectory Prediction via Neural Social Physics -- Towards Open Set Video Anomaly Detection -- ECLIPSE: Efficient Long-Range Video Retrieval Using Sight and Sound -- Joint-Modal Label Denoising for Weakly-Supervised Audio-Visual Video Parsing -- Less than Few: Self-Shot Video Instance Segmentation -- Adaptive Face Forgery Detection in Cross Domain -- Real-Time Online Video Detection with Temporal Smoothing Transformers -- TALLFormer: Temporal Action Localization with a Long-Memory Transformer -- Mining Relations among Cross-Frame Affinities for Video Semantic Segmentation -- TL;DW? …”
    SpringerLink - Click here for access
    Conference Proceeding eBook
  15. 15

    Pattern recognition and computer vision : 4th Chinese Conference, PRCV 2021, Beijing, China, October 29-November 1, 2021, Proceedings. Part I by PRCV (Conference) Beijing, China), SpringerLink (Online service)

    Published: Springer, 2021
    Description: 1 online resource (xix, 617 pages) : illustrations (some color).
    Contents: “….-3D Multi-Object Detection and Tracking with Sparse Stationary LiDAR -- CRNet: Centroid Radiation Network for Temporal Action Localization -- Weakly Supervised Temporal Action Localization with Segment-Level Labels -- Locality-constrained collaborative representation with multi-resolution dictionary for face recognition -- Fast and Fusion: Real-time Pedestrian Detector Boosted by Body-head Fusion -- STA-GCN: Spatio-Temporal AU Graph Convolution Network for Facial Micro-Expression Recognition -- Attentive Contrast Learning Network for Fine-grained Classification -- Relation-Based Knowledge Distillation for Anomaly Detection -- High Power-efficient and Performance-density FPGA Accelerator for CNN-based Object Detection -- Relation-Guided Actor Attention for Group Activity Recognition -- MVAD-Net: Learning View-Aware and Domain-Invariant Representation for Baggage Re-Identification -- Joint Attention Mechanism for Unsupervised Video Object Segmentation.…”
    SpringerLink - Click here for access
    Conference Proceeding eBook
  16. 16

    Image analysis and recognition : 12th International Conference, ICIAR 2015, Niagara Falls, ON, Canada, July 22-24, 2015, Proceedings by ICIAR (Conference) Niagara Falls, Ont.), SpringerLink (Online service)

    Published: Springer, 2015
    Description: 1 online resource (xviii, 543 pages) : illustrations.
    Contents: “…Modelling of Subjective Radiological Assessments with Objective Image Quality Measures of Brain and Body CT Images -- Blind Image Quality Assessment Through Wakeby Statistics Model -- Improving Image Quality of Tiled Displays -- Structural Similarity-Based Optimization Problems with L1-Regularization: Smoothing Using Mollifiers -- Improved Non-Local Means Algorithm Based on Dimensionality Reduction -- Non-local Means for Stereo Image Denoising Using Structural Similarity -- Structural Similarity Optimized Wiener Filter: A Way to Fight Image Noise -- A Real-Time Framework for Detection of Long Linear Infrastructural Objects in Aerial Imagery -- Structural Representations for Multi-modal Image Registration Based on Modified Entropy -- Attributed Relational Graph-Based Learning of Object Models for Object Segmentation -- Label Fusion for Multi-atlas Segmentation Based on Majority Voting -- An Optimized Selective Encryption for Video Confidentiality -- Near-Lossless PCA-Based Compression of Seabed Surface with Prediction -- Adaptive Weighted Neighbors Lossless Image Coding -- Dimensionality Reduction of Proportional Data Through Data Separation Using Dirichlet Distribution -- Image Categorization Using a Heuristic Automatic Clustering Method Based on Hierarchical Clustering -- Semantic Scene Classification with Generalized Gaussian Mixture Models -- Classification of Tooth Shapes for Human Identification Purposes -- An Experimental Comparison of Selected Simple Shape Descriptors -- Micro Genetic and Evolutionary Feature Extraction: An Exploratory Data Analysis Approach for Multispectral Iris Recognition -- Biometric Analysis of Human Ear Matching Using Scale and Rotation Invariant Feature Detectors -- Mutibiometric System Based on Game Theory -- Head Pose Classification Using a Bidimensional Correlation Filter -- Illumination Robust Facial Feature Detection via Decoupled Illumination and Texture Features -- Posed Facial Expression Detection Using Reflection Symmetry and Structural Similarity -- Improving the Recognition of Occluded Faces by Means of Two-Dimensional Orthogonal Projection into Local Subspaces -- Hybrid Age Estimation Using Facial Images -- Unsupervised Sub-graph Selection and Its Application in Face Recognition Techniques -- Dynamic Perceptual Attribute-Based Hidden Conditional Random Fields for Gesture Recognition -- The Bag of Micro-Movements for Human Activity Recognition -- An Efficient Method for Extracting Key-Frames from 3D Human Joint Locations for Action Recognition -- A Simple View-Based Software Architecture for an Autonomous Robot Navigation System -- A Comparison of Feature Detectors and Descriptors in RGB-D SLAM Methods -- Accuracy Improvement for Depth from Small Irregular Camera Motions and Its Performance Evaluation. …”
    SpringerLink - Click here for access
    Conference Proceeding eBook
  17. 17
  18. 18

    Computer vision -- ECCV 2022 : 17th European Conference, Tel Aviv, Israel, October 23-27, 2022 : proceedings. Part XXXV by European Conference on Computer Vision Tel Aviv, Israel, SpringerLink (Online service)

    Published: Springer, 2022
    Description: 1 online resource : illustrations (black and white).
    Contents: “…Efficient One-Stage Video Object Detection by Exploiting Temporal Consistency -- Leveraging Action Affinity and Continuity for Semi-Supervised Temporal Action Segmentation -- Spotting Temporally Precise, Fine-Grained Events in Video -- Unified Fully and Timestamp Supervised Temporal Action Segmentation via Sequence to Sequence Translation -- Efficient Video Transformers with Spatial-Temporal Token Selection -- Long Movie Clip Classification with State-Space Video Models -- Prompting Visual-Language Models for Efficient Video Understanding -- Asymmetric Relation Consistency Reasoning for Video Relation Grounding -- Self-Supervised Social Relation Representation for Human Group Detection -- K-Centered Patch Sampling for Efficient Video Recognition -- A Deep Moving-Camera Background Model -- GraphVid: It Only Takes a Few Nodes to Understand a Video -- Delta Distillation for Efficient Video Processing -- MorphMLP: An Efficient MLP-Like Backbone for Spatial-Temporal Representation Learning -- COMPOSER: Compositional Reasoning of Group Activity in Videos with Keypoint-Only Modality -- E-NeRV: Expedite Neural Video Representation with Disentangled Spatial-Temporal Context -- TDViT: Temporal Dilated Video Transformer for Dense Video Tasks -- Semi-Supervised Learning of Optical Flow by Flow Supervisor -- Flow Graph to Video Grounding for Weakly-Supervised Multi-step Localization -- Deep 360 Optical Flow Estimation Based on Multi-Projection Fusion -- MaCLR: Motion-Aware Contrastive Learning of Representations for Videos -- Learning Long-Term Spatial-Temporal Graphs for Active Speaker Detection -- Frozen CLIP Models Are Efficient Video Learners -- PIP: Physical Interaction Prediction via Mental Simulation with Span Selection -- Panoramic Vision Transformer for Saliency Detection in 360 Videos -- Bayesian Tracking of Video Graphs Using Joint Kalman Smoothing and Registration -- Motion Sensitive Contrastive Learning for Self-Supervised Video Representation -- Dynamic Temporal Filtering In Video Models -- Tip-Adapter: Training-Free Adaption of CLIP for Few-Shot Classification -- Temporal Lift Pooling for Continuous Sign Language Recognition -- MORE: Multi-Order RElation Mining for Dense Captioning in 3D Scenes -- SiRi: A Simple Selective Retraining Mechanism for Transformer-Based Visual Grounding -- Cross-Modal Prototype Driven Network for Radiology Report Generation -- TM2T: Stochastic and Tokenized Modeling for the Reciprocal Generation of 3D Human Motions and Texts -- SeqTR: A Simple Yet Universal Network for Visual Grounding -- VTC: Improving Video-Text Retrieval with User Comments -- FashionViL: Fashion-Focused Vision-and-Language Representation Learning -- Weakly Supervised Grounding for VQA in Vision-Language Transformers -- Automatic Dense Annotation of Large-Vocabulary Sign Language Videos -- MILES: Visual BERT Pre-training with Injected Language Semantics for Video-Text Retrieval -- GEB+: A Benchmark for Generic Event Boundary Captioning, Grounding and Retrieval -- A Simple and Robust Correlation Filtering Method for Text-Based Person Search.…”
    SpringerLink - Click here for access
    Conference Proceeding eBook
  19. 19

    Pattern recognition and computer vision : 5th Chinese Conference, PRCV 2022, Shenzhen, China, October 14-17, 2022, proceedings. Part III by PRCV (Conference) Shenzhen Shi, China), SpringerLink (Online service)

    Published: Springer, 2022
    Description: 1 online resource.
    Contents: “…3D Computer Vision and Reconstruction, Robots and Autonomous Driving -- Locally Geometry-Aware Improvements of LOP for Ecient Skeleton Extraction -- Spherical Transformer: Adapting Spherical Signal to Convolutional Networks -- Waterfall-Net: Waterfall Feature Aggregation for Point Cloud Semantic Segmentation -- Sparse LiDAR and Binocular Stereo Fusion Network for 3D Object Detection -- Full Head Performance Capture Using Multi-Scale Mesh Propagation -- Learning Cross-domain Features for Domain Generalization on Point Clouds -- Unsupervised Pre-training for 3D Object Detection with Transformer -- Global Patch Cross-Attention for Point Cloud Analysis -- EEP-Net: Enhancing local neighborhood features and Ecient semantic segmentation of scale Point Clouds -- CARR-Net: Leveraging on Subtle Variance of Neighbors for Point Cloud Semantic Segmentation -- 3D Meteorological Radar Data Visualization with Point Cloud Completion and Poisson Surface Reconstruction -- JVLDLoc: a Joint Optimization of Visual-LiDAR Constraints and Direction Priors for Localization in Driving Scenario -- A Single-pathway Biomimetic Model for Potential Collision Prediction -- PilotAttnNet: Multi-Modal Attention Network for End-to-End Steering Control -- Stochastic Navigation Command Matching for Imitation Learning of a Driving Policy -- Recognition, Remote Sensing -- Group Activity Representation Learning with Self-Supervised Predictive Coding -- Skeleton-Based Action Quality Assessment via Partially Connected LSTM with Triplet Losses -- Hierarchical Long-Short Transformer for Group Activity Recognition -- GNN-based structural dynamics simulation for modular buildings -- Semantic-Augmented Local Decision Aggregation Network for Action Recognition -- Consensus-Guided Keyword Targeting for Video Captioning -- Handwritten Mathematical Expression Recognition via GCAttention- Based Encoder and Bidirectional Mutual Learning Transformer -- Semi- and Self-Supervised Learning for Scene Text Recognition with Fewer Labels -- TMCR: A Twin Matching Networks for Chinese Scene Text Retrieval -- Thai Scene Text Recognition with Character Combination -- Automatic Examination Paper Scores Calculation and Grades Analysis Based on OpenCV -- Efficient License Plate Recognition via Parallel Position-aware Attention -- Semantic-Aware Non-Local Network for Handwritten Mathematical Expression Recognition -- Math Word Problem Generation with Memory Retrieval -- Traditional Mongolian Script Standard Compliance Testing Based on Deep Residual Network and Spatial Pyramid Pooling -- FOV Recognizer: Telling the Field of View of Movie Shots -- Multi-Level Temporal Relation Graph for Continuous Sign Language Recognition.…”
    SpringerLink - Click here for access
    Conference Proceeding eBook
  20. 20

    Computer vision -- ECCV 2022 : 17th European Conference, Tel Aviv, Israel, October 23-27, 2022 : proceedings. Part V by European Conference on Computer Vision Tel Aviv, Israel, SpringerLink (Online service)

    Published: Springer, 2022
    Description: 1 online resource : illustrations (black and white).
    Contents: “…Adaptive Image Transformations for Transfer-Based Adversarial Attack -- Generative Multiplane Images: Making a 2D GAN 3D-Aware -- AdvDO: Realistic Adversarial Attacks for Trajectory Prediction -- Adversarial Contrastive Learning via Asymmetric InfoNCE -- One Size Does NOT Fit All: Data-Adaptive Adversarial Training -- UniCR: Universally Approximated Certified Robustness via Randomized Smoothing -- Hardly Perceptible Trojan Attack against Neural Networks with Bit Flips -- Robust Network Architecture Search via Feature Distortion Restraining -- SecretGen: Privacy Recovery on Pre-trained Models via Distribution Discrimination -- Triangle Attack: A Query-Efficient Decision-Based Adversarial Attack -- Data-Free Backdoor Removal Based on Channel Lipschitzness -- Black-Box Dissector: Towards Erasing-Based Hard-Label Model Stealing Attack -- Learning Energy-Based Models with Adversarial Training -- Adversarial Label Poisoning Attack on Graph Neural Networks via Label Propagation -- Revisiting Outer Optimization in Adversarial Training -- Zero-Shot Attribute Attacks on Fine-Grained Recognition Models -- Towards Effective and Robust Neural Trojan Defenses via Input Filtering -- Scaling Adversarial Training to Large Perturbation Bounds -- Exploiting the Local Parabolic Landscapes of Adversarial Losses to Accelerate Black-Box Adversarial Attack -- Generative Domain Adaptation for Face Anti-Spoofing -- MetaGait: Learning to Learn an Omni Sample Adaptive Representation for Gait Recognition -- GaitEdge: Beyond Plain End-to-End Gait Recognition for Better Practicality -- UIA-ViT: Unsupervised Inconsistency-Aware Method Based on Vision Transformer for Face Forgery Detection -- Effective Presentation Attack Detection Driven by Face Related Task -- PPT: Token-Pruned Pose Transformer for Monocular and Multi-View Human Pose Estimation -- AvatarPoser: Articulated Full-Body Pose Tracking from Sparse Motion Sensing -- P-STMO: Pre-trained Spatial Temporal Many-to-One Model for 3D Human Pose Estimation -- D&D: Learning Human Dynamics from Dynamic Camera -- Explicit Occlusion Reasoning for Multi-Person 3D Human Pose Estimation -- COUCH: Towards Controllable Human-Chair Interactions -- Identity-Aware Hand Mesh Estimation and Personalization from RGB Images -- C3P: Cross-Domain Pose Prior Propagation for Weakly Supervised 3D Human Pose Estimation -- Pose-NDF: Modeling Human Pose Manifolds with Neural Distance Fields -- CLIFF: Carrying Location Information in Full Frames into Human Pose and Shape Estimation -- DeciWatch: A Simple Baseline for 10O Efficient 2D and 3D Pose Estimation -- SmoothNet: A Plug-and-Play Network for Refining Human Poses in Videos -- PoseTrans: A Simple yet Effective Pose Transformation Augmentation for Human Pose Estimation -- Multi-Person 3D Pose and Shape Estimation via Inverse Kinematics and Refinement -- Overlooked Poses Actually Make Sense: Distilling Privileged Knowledge for Human Motion Prediction -- Structural Triangulation: A Closed-Form Solution to Constrained 3D Human Pose Estimation -- Audio-Driven Stylized Gesture Generation with Flow-Based Model -- Self-Constrained Inference Optimization on Structural Groups for Human Pose Estimation.…”
    SpringerLink - Click here for access
    Conference Proceeding eBook