Skip to content

CoRAL-ASU/getreason

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

24 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

GETReason: Enhancing Image Context Extraction through Hierarchical Multi-Agent Reasoning

GETReason Logo Paper Code

A structured multi-agent framework for extracting rich, contextual narratives from public event images

Paper | arXiv | Code | Dataset | Website

πŸ“Š Presentation Slides | πŸŽ₯ Video | πŸ“„ Poster


🎯 Overview

GETReason is a novel framework that goes beyond surface-level image descriptions to infer deeper contextual meaning from publicly significant event images. Our approach uses a hierarchical multi-agent reasoning system to extract geospatial, temporal, and event-specific information, enabling comprehensive understanding of visual narratives.

Key Features

  • πŸ” Multi-Agent Architecture: Specialized agents for geospatial, temporal, and event reasoning
  • πŸ”„ Cross-Generation: Collaborative validation between agents for enhanced accuracy
  • πŸ“Š GREAT Metric: Novel evaluation metric for reasoning quality assessment
  • 🎯 Event Understanding: Focus on sociopolitical significance rather than just visual content
  • πŸ“ˆ Robust Performance: Substantial improvements over existing captioning and reasoning baselines

πŸ—οΈ Architecture

GETReason Architecture

GETReason's hierarchical multi-agent reasoning framework

Agent Specialization

  • 🌍 Geospatial Agent: Infers location, country, city, and geographic context
  • ⏰ Temporal Agent: Extracts dates, periods, and historical context
  • 🎭 Event Agent: Identifies events, political significance, and sociopolitical context

πŸ“Š Results

Our framework demonstrates significant improvements:

  • Enhanced Accuracy: Better geospatial and temporal inference
  • Reduced Hallucinations: Structured approach minimizes misleading information
  • Improved Generalization: Robust performance across diverse event types
  • Contextual Understanding: Deeper insights into event significance

πŸ—‚οΈ Repository Structure

getreason/
β”œβ”€β”€ README.md                 # This file - Project overview
β”œβ”€β”€ code/                     # Implementation and experiments
β”‚   β”œβ”€β”€ README.md            # Detailed setup and usage instructions
β”‚   β”œβ”€β”€ gpt_workbench.ipynb  # GPT-4o-mini experiments
β”‚   β”œβ”€β”€ gemini_workbench.ipynb # Gemini experiments
β”‚   β”œβ”€β”€ dataset/             # Augmented datasets
β”‚   β”‚   β”œβ”€β”€ gold_tara.jsonl  # TARA dataset (11,240 samples)
β”‚   β”‚   └── gold_wikitilo.jsonl # WikiTilo dataset (6,296 samples)
β”‚   └── assets/              # Prompts, schemas, and data
└── Paper/                   # Research paper implementation
    └── GETReason/           # Single image workflow

πŸš€ Quick Start

For detailed setup and usage instructions, see the code directory:

# Clone the repository
git clone https://github.com/coral-lab-asu/getreason.git
cd getreason

# Navigate to code directory for implementation
cd code

# Follow the detailed setup instructions in code/README.md

πŸ“š Datasets

We provide two augmented datasets with comprehensive annotations:

TARA Dataset

  • Size: 11,240 samples
  • Content: Rich event information with reasoning
  • Coverage: News events from 2010-2021

WikiTilo Dataset

  • Size: 6,296 samples
  • Content: Temporal and geospatial information
  • Coverage: Historical events from 1826-2021

πŸ”¬ Research Impact

GETReason addresses critical challenges in:

  • πŸ“° Journalism: Automated event understanding for news analysis
  • πŸ“š Education: Historical context extraction for educational content
  • πŸ›οΈ Archival Analysis: Systematic organization of event imagery
  • πŸ” Fact-Checking: Reliable extraction of contextual information

πŸ“„ Citation

If you use this work in your research, please cite:

@inproceedings{siingh-etal-2025-getreason,
    title = "{GETR}eason: Enhancing Image Context Extraction through Hierarchical Multi-Agent Reasoning",
    author = "Siingh, Shikhhar  and
      Rawat, Abhinav  and
      Baral, Chitta  and
      Gupta, Vivek",
    booktitle = "Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)",
    month = jul,
    year = "2025",
    address = "Vienna, Austria",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2025.acl-long.1439/",
    doi = "10.18653/v1/2025.acl-long.1439",
    pages = "29779--29800"
}

πŸ‘₯ Authors

  • Shikhhar Siingh - Arizona State University
  • Abhinav Rawat - Arizona State University
  • Chitta Baral - Arizona State University
  • Vivek Gupta - Arizona State University

πŸ“ž Contact

For questions, or issues, or collaboration:

πŸ“„ License

This project is licensed under the MIT License - see the LICENSE file for details.


GETReason: Enhancing Image Context Extraction through Hierarchical Multi-Agent Reasoning

ACL 2025 arXiv

About

GETReason is a novel framework that goes beyond surface-level image descriptions to infer deeper contextual meaning from publicly significant event images. Our approach uses a hierarchical multi-agent reasoning system to extract geospatial, temporal, and event-specific information, enabling comprehensive understanding of visual narratives.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors