Rationale
The initial ranking based on similarities between the embeddings of the visual features of
the query and gallery images will contain a mixture of correct and incorrect matches. However, the
incorrect matches will not be ranked highly if ranked against correct, highly-
ranked gallery images. Since gallery-gallery ranks depend on comparing visual features within the
same domain, the problem is simplified and we are able to extract more semantically-meaningful
matches. This allows initially highly-ranked gallery images to be penalised and
moved down the query-gallery ranked list. Iterating this
process enables discovery of gallery images that are more
distantly connected to the query via multiple gallery-gallery
matches.
In some cases, although the query and gallery images are of the same class they may continue very
dissimilar visual features.
We provide a concrete example for a query sketch against gallery photos (though note that our
approach is not limited to sketch-photo retrieval). The query sketch
of some wooden doors (A) initially assigns a low rank to
the gallery glass doors (B), even though they both belong to
the same class. However, the gallery image of the church doors (C) contains similar visual features
to the sketch (e.g. curved door shape, wooden material). The church
doors contain some similarity to the glass doors (both are
within door frames) however the shape and materials are
different and so (B) is ranked relatively low in the gallery-
gallery ranks for (C). However, the gallery image of the
street house doors (D) are similar enough to (C) to be highly
ranked (both contain a wooden door within a door frame)
and, in turn, the house doors (D) are similar enough to the
glass doors (B) to be highly ranked in the gallery-gallery
ranks for (D) (rectangular door, within door frame). Our
iterative re-ranking process is able to discover such connections and raise the rank position of
difficult matches such as (B) against a very different query (A).
This approach emulates the cognitive mechanism employed by the human brain, leveraging thematic
semantic systems to establish meaningful connections between concepts.