Skip to content

2024-12-09: Media description with Azure Content Understanding

Latest
Compare
Choose a tag to compare
@pamelafox pamelafox released this 09 Dec 18:45
· 2 commits to main since this release
0bb3f95

This release adds a new optional feature for data ingestion that sends each figure in a document to Azure Content Understanding to generate a description. This is an alternative approach to our vision approach for handling documents with images, though it could even be used in conjunction with it.

For example, for this document:
Screenshot 2024-12-09 at 10 48 44 AM

The graph is replaced by this description:
Screenshot 2024-12-09 at 10 50 52 AM

The description is stored as HTML in the chunk, for maximal expressivity:

  <figure>
   <figcaption>Impact of Interest Rates, Inflation, and GDP Growth on Financial Markets<br>
   The line graph titled 'On Financial Markets' displays trends in Interest Rates, Inflation Data, and GDP Growth from 2018 to 2023. The x-axis represents the years, while the y-axis shows percentage values ranging from -5 to 5. Interest Rates start at 2% in 2018, dip to 1% in 2020, and rise to 2% by 2023. Inflation Data begins at 2%, peaks at 3% in 2022, and slightly decreases to 2.5% in 2023. GDP Growth shows a significant drop to -4% in 2020, recovers to 3% in 2021, and stabilizes at 2% by 2023.
    <table>
      <thead>
        <tr>
          <td>Year</td>
          <td>Interest Rates %</td>
          <td>Inflation Data %</td>
          <td>GDP Growth %</td>
        </tr>
      </thead>
      <tbody>
        <tr>
          <td>2018</td>
          <td>2</td>
          <td>2</td>
          <td>2</td>
        </tr>
        <tr>
          <td>2019</td>
          <td>2</td>
          <td>2.5</td>
          <td>1</td>
        </tr>
        <tr>
          <td>2020</td>
          <td>1</td>
          <td>1.5</td>
          <td>-4</td>
        </tr>
        <tr>
          <td>2021</td>
          <td>1.5</td>
          <td>2</td>
          <td>3</td>
        </tr>
        <tr>
          <td>2022</td>
          <td>2</td>
          <td>3</td>
          <td>2</td>
        </tr>
        <tr>
          <td>2023</td>
          <td>2</td>
          <td>2.</td>
        </tr>
      </tbody>
    </table></figure>

See the doc for enabling the feature:
https://github.com/Azure-Samples/azure-search-openai-demo/blob/main/docs/deploy_features.md#enabling-media-description-with-azure-content-understanding

Please try it out and give feedback.

What's Changed

  • Add media description feature using Azure Content Understanding by @pamelafox in #2195

Full Changelog: 2024-12-02...2024-12-09