Buyer‐Seller Model Design

Buyer-Seller Model Design

Overview

This project uses a dual-agent architecture in which two separate models—buyer and seller—learn independently while being interdependent. Their coordination enables trading behavior to emerge from two distinct perspectives.

Why Separate Buyer and Seller?

The buying decision and the selling decision are fundamentally different in nature.
Input features and relevant timing considerations differ.
Training a single model to output Buy, Sell, and Hold decisions requires it to learn two logically opposing behaviors, which may be inefficient or unstable.

Instead, we train:

A buyer model that learns when to enter.
A seller model that learns when to exit.

Reward Interdependence

Each agent’s reward depends on the other's decision:

The buyer’s reward depends on how well the seller exits.
The seller’s reward depends on the quality of the entry chosen by the buyer.

This mutual dependence results in a cooperative learning dynamic. Improvements in one agent can lead to improved learning signals for the other.

Is This Multi-Agent Reinforcement Learning?

Partially. While it resembles multi-agent RL due to multiple policies and shared consequences, there are key differences:

The buyer and seller do not act simultaneously in a shared environment.
Their actions are temporally separated (entry and exit).
It is better described as a role decomposition of a single-agent task, rather than true multi-agent interaction.

Alternative Approach: Unified Agent

We also considered using one agent with three actions:

Buy
Sell
Hold

However:

It introduces action constraints (e.g., cannot sell without having bought).
It increases the complexity of learning.
It lacks the clean separation of roles, making interpretability harder.

Benefits of the Current Setup

Role clarity for model behavior
Modular and easier to debug or adjust
Encourages specialized learning in each part of the decision process

Future Directions

Empirical comparison with unified-agent approach
Exploration of shared representations between buyer and seller
Potential extensions to market-making or pair-trading scenarios

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Buyer‐Seller Model Design

Buyer-Seller Model Design

Overview

Why Separate Buyer and Seller?

Reward Interdependence

Is This Multi-Agent Reinforcement Learning?

Alternative Approach: Unified Agent

Benefits of the Current Setup

Future Directions

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Clone this wiki locally