1. 22 Oct, 2020 1 commit
  2. 18 Aug, 2020 1 commit
    • Manoj Plakal's avatar
      Added TF-Lite-compatible feature extractor and model exporter for YAMNet (#9098) · 8da48573
      Manoj Plakal authored
      * Added TF-Lite-compatible feature extractor and model exporter for YAMNet.
      
      - Added a TF-Lite compatible feature extractor. With the latest TF-Lite,
        that involves a DFT-multiplication replacement for tf.abs(tf.signal.stft())
        and not a lot else. Note that TF-Lite now allows variable-length inputs.
      - Added a YAMNet exporter that produces TF2 SavedModels, TF-Lite models,
        and TF-JS models.
      - Cleanups: switched hyperparameters to a dataclass, got rid of
        some lingering cruft in yamnet_test.
      
      * Responded to DAn's comments in https://github.com/tensorflow/models/pull/9098
      
      - Switched some hparams to float
      - Made class map asset available on the exported model, and tested that
        it can be loaded from the various exports.
      8da48573
  3. 12 Aug, 2020 1 commit
    • Manoj Plakal's avatar
      Input/Output tweaks for YAMNet and VGGish. (#9092) · 9b179e8e
      Manoj Plakal authored
      * Input/Output tweaks for YAMNet and VGGish.
      
      - Waveform input for YAMNet is now padded so that we get at least
        one patch of log mel spectrogram. The VGGish TF-Hub exporter
        uses YAMNet's feature computation so the VGGish export will
        also pad waveform input similarly.
      - Added a 1024-D embedding output to YAMNet so we now produce
        predicted scores, log mel spectrogram features, and embeddings,
        to satisfy a variety of uses: class prediction, acoustic
        feature visualization, semantic feature extraction.
      - Simplified usage of YAMNet in inference mode. Instead of trying
        to work around implicit batch size issues in the Model.predict()
        API, we simply __call__() the Model.
      - Switched inference.py to TF 2 and Eager execution.
      - Updated the visualization notebook: now uses TF2/Eager and
        can be loaded and run in Google Colab.
      
      * Responded to DAn's comments in https://github.com/tensorflow/models/pull/9092
      
      - Merged spectrogram computation and framing into a single function
        that returns both spectrogram and framed features.
      - Extended waveform padding to pad up to an integral number of hops
        in addition to the final STFT analysis window.
      9b179e8e
  4. 09 Aug, 2020 1 commit
  5. 17 Apr, 2020 1 commit
  6. 26 Feb, 2020 1 commit
  7. 22 Feb, 2020 1 commit
  8. 20 Feb, 2020 1 commit
  9. 18 Jan, 2020 1 commit
    • Manoj Plakal's avatar
      Cleaned up dependences and install instructions for vggish and yamnet. (#8059) · 831281ce
      Manoj Plakal authored
      - Made code work with either TF v1.x or TF v2.x, while explicitly
        enabling v1.x behavior.l
      - Pulled slim from tf_slim package instead of through tensorflow
        contrib. Note that tf_slim itself uses tensorflow contrib so
        it requires using TF v1.x for now (referenced a relevant PR
        which should remove this limitation once it gets merged).
      - Removed all mention of scipy. Switched wav writing to soundfile.
      - Switched package name to soundfile instead of pysoundfile. The
        former is the newer name.
      - Updated installation instructions for both vggish and yamnet to
        reflect these changes.
      - Tested new installation procedures. vggish works with TF v1.15,
        yamnet works with TF v1.15.0 as well as TF v2.1.0.
      831281ce
  10. 14 Jan, 2020 1 commit
  11. 21 Nov, 2019 1 commit