+

Research Posts

Caravan MultiMet: Extending Caravan with Multiple Weather Nowcasts and Forecasts

Papers with Code
Reporter Kate Martin

By Kate Martin

Posted on: November 15, 2024

Caravan MultiMet: Extending Caravan with Multiple Weather Nowcasts and Forecasts

The Caravan large-sample hydrology dataset (Kratzert et al., 2023) was created to standardize and harmonize streamflow data from various regional datasets, combined with globally available meteorological forcing and catchment attributes. This community-driven project also allows researchers to conve...

Read More

Scaling Mesh Generation via Compressive Tokenization

Papers with Code
Reporter Kate Martin

By Kate Martin

Posted on: November 13, 2024

Scaling Mesh Generation via Compressive Tokenization

We propose a compressive yet effective mesh representation, Blocked and Patchified Tokenization (BPT), facilitating the generation of meshes exceeding 8k faces. BPT compresses mesh sequences by employing block-wise indexing and patch aggregation, reducing their length by approximately 75\% compared ...

Read More

The Surprising Effectiveness of Test-Time Training for Abstract Reasoning

Papers with Code
Reporter Naomi Wilson

By Naomi Wilson

Posted on: November 13, 2024

The Surprising Effectiveness of Test-Time Training for Abstract Reasoning

Language models have shown impressive performance on tasks within their training distribution, but often struggle with novel problems requiring complex reasoning. We investigate the effectiveness of test-time training (TTT) -- updating model parameters temporarily during inference using a loss deriv...

Read More

CDXFormer: Boosting Remote Sensing Change Detection with Extended Long Short-Term Memory

Papers with Code
Reporter Kate Martin

By Kate Martin

Posted on: November 13, 2024

CDXFormer: Boosting Remote Sensing Change Detection with Extended Long Short-Term Memory

In complex scenes and varied conditions, effectively integrating spatial-temporal context is crucial for accurately identifying changes. However, current RS-CD methods lack a balanced consideration of performance and efficiency. CNNs lack global context, Transformers have quadratic computational com...

Read More

BitNet a4.8: 4-bit Activations for 1-bit LLMs

Papers with Code
Reporter Kate Martin

By Kate Martin

Posted on: November 11, 2024

BitNet a4.8: 4-bit Activations for 1-bit LLMs

Recent research on the 1-bit Large Language Models (LLMs), such as BitNet b1.58, presents a promising direction for reducing the inference cost of LLMs while maintaining their performance. In this work, we introduce BitNet a4.8, enabling 4-bit activations for 1-bit LLMs. BitNet a4.8 employs a hybrid...

Read More

Enhancing Investment Analysis: Optimizing AI-Agent Collaboration in Financial Research

Papers with Code
Reporter Naomi Wilson

By Naomi Wilson

Posted on: November 11, 2024

Enhancing Investment Analysis: Optimizing AI-Agent Collaboration in Financial Research

In recent years, the application of generative artificial intelligence (GenAI) in financial analysis and investment decision-making has gained significant attention. However, most existing approaches rely on single-agent systems, which fail to fully utilize the collaborative potential of multiple AI...

Read More

MVSplat360: Feed-Forward 360 Scene Synthesis from Sparse Views

Papers with Code
Reporter Javier Vásquez

By Javier Vásquez

Posted on: November 11, 2024

MVSplat360: Feed-Forward 360 Scene Synthesis from Sparse Views

We introduce MVSplat360, a feed-forward approach for 360{\deg} novel view synthesis (NVS) of diverse real-world scenes, using only sparse observations. This setting is inherently ill-posed due to minimal overlap among input views and insufficient visual information provided, making it challenging fo...

Read More

Convolutional Differentiable Logic Gate Networks

Papers with Code
Reporter Naomi Wilson

By Naomi Wilson

Posted on: November 11, 2024

Convolutional Differentiable Logic Gate Networks

With the increasing inference cost of machine learning models, there is a growing interest in models with fast and efficient inference. Recently, an approach for learning logic gate networks directly via a differentiable relaxation was proposed. Logic gate networks are faster than conventional neura...

Read More