root, Author at BytesArchive

February 10, 2024

Adaptive Surface Normal Constraint for Geometric Estimation from Monocular Images

This article presents a new approach to learn geometries such as depth and surface normal from images….

February 10, 2024

The paper discusses CREMA, a modality-fusion framework aimed at enhancing the efficiency of multimodal compositional video reasoning….

February 10, 2024

This article presents Mamba-ND, a design that extends the Mamba architecture to multi-dimensional data. Transformers, the de-facto…

February 10, 2024

The Segment Anything Model (SAM) is a popular image processing tool for its segmentation accuracy, variety of…

February 10, 2024

The article introduces a novel method called Point-VOS for Video Object Segmentation (VOS). Traditional VOS methods require…

February 10, 2024

The paper presents a novel method for generating PBR images directly, avoiding the challenges and inaccuracies associated…

February 10, 2024

The article introduces SPHINX-X, a Multi-modality Large Language Model (MLLM) series, which is an enhancement of the…

February 10, 2024

The study presents InstaGen, a novel method to enhance object detector’s ability by training on synthetic dataset…

February 9, 2024

The study aimed to identify machine learning models that could efficiently categorize tweets concerning eating disorders. Over…

February 9, 2024

This research focuses on understanding the information encoded in speech processing by using vector representations of speech…