Simplify and generalize host control of machine #257

diegonehab · 2024-07-08T14:18:57Z

Context

Controlling a machine engaged with rollups involves reading and writing to a bunch of different machine CSRs. The introduction of the send_cmio_response simplified, but it is still complicated.

run now returns the break_reason, which simplifies it further. But now there are unnecessary redundancies that may cause confusion.

Possible solutions

At the moment, iflags has fields PRV and the X, Y, and H flags.

PRV really is something internal that the host should never mess with (the current machine privilege level).

Let's promote iflags.PRV to its own full CSR iprv. This will simplify the state-access implementations, since they won't have to do field manipulation there anymore.

X, Y, and H, on the other hand, are things the host needs to look at and, in the case of Y, change.

There is never a case in which more than one of these flags is set. They are always set via HTIF from the inside.

In the case of X and Y, the host will also need to look into htif.tohost. This is because htif.tohost contains the reason for the yield and the amount of data written to tx_buffer.

X is set when the machine returns from an automatic yield. Let's remove X altogether, since the machine run already returns this as a break reason and X is cleared automatically.

Y when it returns from a manual yield.

H when it is permanently halted.

Let's relocate Y and H to htif.tohost. With some reorganization there, we can make this happen.
(This will also simplify HTIF implementation, since it won't need to change the iflags register anymore.)
Let's rename Y to YM to make the distinction obvious. It's not a generic yield flag, but rather a Manual Yield flag.

Perhaps we can be smart and use the device+cmd fields together as "the flag", with a few changes to make them uniquely identify the halt and the manual yields.

There already are many WARL CSRs that prevent certain bits from being changed.
htif.tohost would be one of these. If H is set, it would remain set forever. I think we can even use a write to htif.fromhost to clear YM, saving the need to modify htif.tohost when returning from a manual yield.

The text was updated successfully, but these errors were encountered:

diegonehab · 2024-07-08T15:53:35Z

After some thought, here is a possibility.

A machine is halted if tohost has dev=HTIF_DEV_HALT, cmd=HTIF_HALT_CMD_HALT, and (data & 1).
A machine is yielded manually if tohost has dev = HTIF_DEV_YIELD and cmd = HTIF_YIELD_CMD_MANUAL.

We change the part of the interpret loop that checks for fixed-point yield/halt to the following:

tohost = read_tohost();
if (halted(tohost)) { // dev=HTIF_DEV_HALT, cmd=HTIF_HALT_CMD_HALT, (data & 1) 
    return break_reason::halted;
}
if (yielded(tohost)) { // dev = HTIF_DEV_YIELD, cmd = HTIF_YIELD_CMD_MANUAL
    formhost = read_fromhost();
    if (!yielded(fromhost)) { // unless host wrote a response to this htif-yield command...
        return break_reason::yielded_manually;
    } 
    // here we know the host responded, so we clear tohost and the machine is not yielded anymore
    write_tohost(0);
}

We change the HTIF protocol to be as follows:

From the inside, to use HTIF, guest code writes dev+cmd+data to tohost. HTIF device itself then clears fromhost. If device is halt or yield, the run() returns. From the outside, host can check tohost to see what is up. To respond to a yield, host copies dev+cmd to fromhost, but changes the data as desired and resumes the machine. If device was yield and fromhost has the right combination of dev+cmd, the machine clears tohost. From the inside, guest code reads the response in fromhost.

We also change write_tohost() to guard against the removal of a halted combination of dev+cmd even from the outside.

diegonehab · 2025-02-21T09:59:49Z

Hey @stskeeps, we are thinking of following this idea (or some refinement of it) on the release after the next one. Do you have reservations? @GCdePaula? @vfusco @mpolitzer

mpolitzer · 2025-02-24T13:46:16Z

What you described seem to be the current behavior from the guest POV on yield[1].
https://github.com/cartesi/opensbi/blob/opensbi-1.3.1-ctsi-y/platform/cartesi/htif.c#L106

Is that so? would there be any required changes in: opensbi, linux or libcmt?

Seems close to the current workflow on the host side. I don't se an issue there.

diegonehab · 2025-02-24T13:59:02Z

I think we can do further than what I suggested above. Right now, we have DEV/CMD/(REASON/DATA). This seems to be very overkill.

We should perhaps have CMD(16)/DATA(48) and be done with it? With new CMD getting the current DEV/CMD/REASON. Or some other simpler division.

Also remove the LSB requirement from HALT, which is also overkill. If you issue a HALT, you are halted. :)

GCdePaula · 2025-02-24T14:40:15Z

I'm down for simplification, I don't mind updating our code. I'm trying to understand the changes.

Would this remove the possibility of querying "what was the latest break reason"? Like, it seems it's currently possible to look only at the current state of the machine and know what was the reason the machine broke last.

We should perhaps have CMD(16)/DATA(48) and be done with it? With new CMD getting the current DEV/CMD/REASON. Or some other simpler division.

What would be the possible values for CMD in this proposal? In the case of tohost, would it be representable (abstractly) by something like this?

type ToHost =
    | Automatic { reason: AutomaticReason }
    | Manual { reason: ManualReason }

type AutomaticReason =
    | Progress { mille_progress: u32 }
    | TxOutput { data_size: u48 }
    | TxReport { data_size: u48 }

type ManualReason {
    | RxAccepted { data_size: u48 } // will always be 32?
    | RxRejected
    | TxException { data_size: u48 }
    | GIO { domain: u16, data_size: u48 }

And the fromhost would be something like:

type FromHost
    | Advance  { data_size: u48 }
    | Inspect { data_size: u48 }

Why do we need both tohost and fromhost? Can't we use the same register?

GCdePaula · 2025-02-24T15:12:47Z

Also, do we need both halt and yield?

diegonehab · 2025-02-24T15:16:47Z

The break reason would still be visible in tohost/mcycle.
There is a chance we can simplify this even further.
The important part is the external view, not the internal.
The difference between HALT and YIELD and even PUTCHAR is still the same: it's a different command that you are sending to HTIF. You don't want to mix "I am ok, send me more input" with "There is nothing you can do, I am done." or "Output some character."

diegonehab added the refactor Restructuring code, while not changing its original functionality label Jul 8, 2024

diegonehab added this to Machine Emulator SDK Jul 8, 2024

github-project-automation bot moved this to Todo in Machine Emulator SDK Jul 8, 2024

diegonehab mentioned this issue Jul 8, 2024

Split and rename iflags_Y, iflags_X, and iflags_H #52

Closed

6 tasks

diegonehab mentioned this issue Aug 7, 2024

refactor!: design new simplified C API #253

Closed

diegonehab changed the title ~~Simplify host control of machine~~ Simplify and generalize host control of machine Sep 27, 2024

This comment has been minimized.

Sign in to view

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Simplify and generalize host control of machine #257

Simplify and generalize host control of machine #257

diegonehab commented Jul 8, 2024

diegonehab commented Jul 8, 2024

This comment has been minimized.

diegonehab commented Feb 21, 2025 •

edited

Loading

mpolitzer commented Feb 24, 2025

diegonehab commented Feb 24, 2025 •

edited

Loading

GCdePaula commented Feb 24, 2025 •

edited

Loading

GCdePaula commented Feb 24, 2025

diegonehab commented Feb 24, 2025

Simplify and generalize host control of machine #257

Simplify and generalize host control of machine #257

Comments

diegonehab commented Jul 8, 2024

Context

Possible solutions

diegonehab commented Jul 8, 2024

This comment has been minimized.

diegonehab commented Feb 21, 2025 • edited Loading

mpolitzer commented Feb 24, 2025

diegonehab commented Feb 24, 2025 • edited Loading

GCdePaula commented Feb 24, 2025 • edited Loading

GCdePaula commented Feb 24, 2025

diegonehab commented Feb 24, 2025

diegonehab commented Feb 21, 2025 •

edited

Loading

diegonehab commented Feb 24, 2025 •

edited

Loading

GCdePaula commented Feb 24, 2025 •

edited

Loading