Patent attributes
In an embodiment, the disclosed technologies include identifying a content item of a first digital data source as a candidate for linking with a target entity of a second digital data source by matching a candidate entity mentioned in the content item to the target entity in accordance with semantic similarity data computed between the candidate entity and the target entity; inputting at least one feature of the content item and at least one feature of the target entity to a set of digital models that analyze the at least one feature of the content item and the at least one feature of the target entity and determine and output qualitative data; based on the qualitative data, determining link risk data; based on the link risk data and the semantic similarity data, and determining whether to generate a link between the content item and the target entity.