Skip to content

Human In The Loop

starain-pactera edited this page Nov 16, 2020 · 9 revisions

(draft)

HUMAN IN THE LOOP (HITL)

Human in the loop(HITL) is a way to speed up the labeling process and improve model quality.

When you have a lot of documents to label, and analyze a lot of testing files to improve model quality, it's a long process. HITL can help you here.

how does HITL work?

scenario 1: Analyze a local file and then add it to training set

  1. train a model with 5 labeled documents image image
  2. pick a up local file and analyze it, if you find there are incorrect fields, you could add the document to training set then adjust the labels if necessary. image image
  3. training new model with new labeling data. image
  4. repeat step 2 - 3 until the model is good enough.

scenario 2: Auto-label current document

  1. upload lots of training documents into your Azure blob storage, train a model with 5 labeled documents.
  2. click auto-label, FoTT will automatically pick up other files in the blob storage and analyze them, then turn the analyze results as labels. image
  3. you can revise the labels of those files if necessary, notice the icons of the document is changed, indicating it’s been revised and ready to be used for training. image
  4. you could start a new training session with newly revised files.
  5. repeat step 2 - 4 until the model is good enough.
Clone this wiki locally