speedup caption generator on im2txt (#6255)

* replace the usage of python's built in sort with numpy's argsort, which is faster. Also use the given axes to get the most_likely_words instead of calling enumerate and list (it's slower and more memory conssuming) * correct indentation and redaction on comment for the changes purposes according to cshallue's comment

speedup caption generator on im2txt (#6255)
* replace the usage of python's built in sort with numpy's argsort, which is faster. Also use the given axes to get the most_likely_words instead of calling enumerate and list (it's slower and more memory conssuming) * correct indentation and redaction on comment for the changes purposes according to cshallue's comment
cce6c09b · Rodrigo Laguna · Chris Shallue · e4a046e7 · cce6c09b
Commit cce6c09b authored Mar 06, 2019 by Rodrigo Laguna Committed by Chris Shallue Mar 06, 2019
Hide whitespace changes
Inline Side-by-side

Showing with 7 additions and 5 deletions

research/im2txt/im2txt/inference_utils/caption_generator.py research/im2txt/im2txt/inference_utils/caption_generator.py +7 -5

No files found.
--- a/research/im2txt/im2txt/inference_utils/caption_generator.py
+++ b/research/im2txt/im2txt/inference_utils/caption_generator.py
@@ -176,11 +176,13 @@ class CaptionGenerator(object):
        word_probabilities = softmax[i]
        state = new_states[i]
        # For this partial caption, get the beam_size most probable next words.
-        words_and_probs = list(enumerate(word_probabilities))
-        words_and_probs.sort(key=lambda x: -x[1])
-        words_and_probs = words_and_probs[0:self.beam_size]
-        # Each next word gives a new partial caption.
-        for w, p in words_and_probs:
+        # Sort the indexes with numpy, select the last self.beam_size
+        # (3 by default) (ie, the most likely) and then reverse the sorted
+        # indexes with [::-1] to sort them from higher to lower.
+        most_likely_words = np.argsort(word_probabilities)[:-self.beam_size][::-1]
+
+        for w in most_likely_words:
+          p = word_probabilities[w]
          if p < 1e-12:
            continue  # Avoid log(0).
          sentence = partial_caption.sentence + [w]