Log in
Enquire now
‌

US Patent 11244111 Adaptive attention model for image captioning

Patent 11244111 was granted and assigned to Salesforce.com, Inc. on February, 2022 by the United States Patent and Trademark Office.

OverviewStructured DataIssuesContributors

Contents

Is a
Patent
Patent

Patent attributes

Patent Applicant
Current Assignee
Patent Jurisdiction
United States Patent and Trademark Office
United States Patent and Trademark Office
Patent Number
11244111
Date of Patent
February 8, 2022
Patent Application Number
16668333
Date Filed
October 30, 2019
Patent Citations
‌
US Patent 10346721 Training a neural network using augmented training datasets
‌
US Patent 10558750 Spatial attention model for image captioning
‌
US Patent 10395118 Systems and methods for video paragraph captioning using hierarchical recurrent neural networks
‌
US Patent 10565305 Adaptive attention model for image captioning
‌
US Patent 10013640 Object recognition from videos using recurrent neural networks
‌
US Patent 10032498 Memory cell unit and recurrent neural network including multiple memory cell units
‌
US Patent 10133729 Semantically-relevant discovery of solutions
‌
US Patent 10282663 Three-dimensional (3D) convolution with 3D batch normalization
Patent Citations Received
‌
US Patent 11481563 Translating texts for videos based on video context
Patent Primary Examiner
‌
Hadi Akhavannik
Patent abstract

The technology disclosed presents a novel spatial attention model that uses current hidden state information of a decoder long short-term memory (LSTM) to guide attention and to extract spatial image features for use in image captioning. The technology disclosed also presents a novel adaptive attention model for image captioning that mixes visual information from a convolutional neural network (CNN) and linguistic information from an LSTM. At each timestep, the adaptive attention model automatically decides how heavily to rely on the image, as opposed to the linguistic model, to emit the next caption word. The technology disclosed further adds a new auxiliary sentinel gate to an LSTM architecture and produces a sentinel LSTM (Sn-LSTM). The sentinel gate produces a visual sentinel at each timestep, which is an additional representation, derived from the LSTM's memory, of long and short term visual and linguistic information.

Timeline

No Timeline data yet.

Further Resources

Title
Author
Link
Type
Date
No Further Resources data yet.

References

Find more entities like US Patent 11244111 Adaptive attention model for image captioning

Use the Golden Query Tool to find similar entities by any field in the Knowledge Graph, including industry, location, and more.
Open Query Tool
Access by API
Golden Query Tool
Golden logo

Company

  • Home
  • Press & Media
  • Blog
  • Careers
  • WE'RE HIRING

Products

  • Knowledge Graph
  • Query Tool
  • Data Requests
  • Knowledge Storage
  • API
  • Pricing
  • Enterprise
  • ChatGPT Plugin

Legal

  • Terms of Service
  • Enterprise Terms of Service
  • Privacy Policy

Help

  • Help center
  • API Documentation
  • Contact Us