Commits · 2d2582fca7b46091f324f005c8f9f439a829b45f · ModelZoo / ResNet50_tensorflow

22 Oct, 2020 1 commit
- Updated YAMNet visualization notebook to match latest model code. (#9406) · 2d2582fc
  Manoj Plakal authored Oct 22, 2020
  
  2d2582fc
18 Aug, 2020 1 commit

Added TF-Lite-compatible feature extractor and model exporter for YAMNet (#9098) · 8da48573

Manoj Plakal authored Aug 17, 2020

* Added TF-Lite-compatible feature extractor and model exporter for YAMNet.

- Added a TF-Lite compatible feature extractor. With the latest TF-Lite,
  that involves a DFT-multiplication replacement for tf.abs(tf.signal.stft())
  and not a lot else. Note that TF-Lite now allows variable-length inputs.
- Added a YAMNet exporter that produces TF2 SavedModels, TF-Lite models,
  and TF-JS models.
- Cleanups: switched hyperparameters to a dataclass, got rid of
  some lingering cruft in yamnet_test.

* Responded to DAn's comments in https://github.com/tensorflow/models/pull/9098

- Switched some hparams to float
- Made class map asset available on the exported model, and tested that
  it can be loaded from the various exports.

8da48573

12 Aug, 2020 1 commit

Input/Output tweaks for YAMNet and VGGish. (#9092) · 9b179e8e

Manoj Plakal authored Aug 12, 2020

* Input/Output tweaks for YAMNet and VGGish.

- Waveform input for YAMNet is now padded so that we get at least
  one patch of log mel spectrogram. The VGGish TF-Hub exporter
  uses YAMNet's feature computation so the VGGish export will
  also pad waveform input similarly.
- Added a 1024-D embedding output to YAMNet so we now produce
  predicted scores, log mel spectrogram features, and embeddings,
  to satisfy a variety of uses: class prediction, acoustic
  feature visualization, semantic feature extraction.
- Simplified usage of YAMNet in inference mode. Instead of trying
  to work around implicit batch size issues in the Model.predict()
  API, we simply __call__() the Model.
- Switched inference.py to TF 2 and Eager execution.
- Updated the visualization notebook: now uses TF2/Eager and
  can be loaded and run in Google Colab.

* Responded to DAn's comments in https://github.com/tensorflow/models/pull/9092

- Merged spectrogram computation and framing into a single function
  that returns both spectrogram and framed features.
- Extended waveform padding to pad up to an integral number of hops
  in addition to the final STFT analysis window.

9b179e8e

09 Aug, 2020 1 commit

Made VGGish and YAMNet work in TF2 without disabling TF2 behavior. (#9077) · 557eec27

Manoj Plakal authored Aug 09, 2020

* Made VGGish and YAMNet work in TF2 without disabling TF2 behavior.

Allowed TF2 behavior and allowed passing in a features tensor into the
VGGish model definition. Both of these changes are needed for making
TF-Hub exports of these models. Lifted constraints on TF versions since
tf_slim has been updated to work with TF 2.

* Responded to DAn's comments in https://github.com/tensorflow/models/pull/9077

* Fixed typo in comment.

557eec27

17 Apr, 2020 1 commit
- Remove stray period in yamnet README.md. (#8269) · a29fcd96
  Dan Ellis authored Apr 17, 2020
  
  a29fcd96
26 Feb, 2020 1 commit
- Add the tf2-compatible graph wrapping idiom to inference.py. (#8203) · 83f56818
  Dan Ellis authored Feb 26, 2020
  
  83f56818
22 Feb, 2020 1 commit

YAMNet variable sr (#8174) · 7b1553fd

Dan Ellis authored Feb 21, 2020


* Modify yamnet_visualization to work for any (?) sampling rate in input wav file.

7b1553fd

20 Feb, 2020 1 commit
- Update yamnet_visualization.ipynb to work under TF2.0. (#8161) · 1f61912a
  Dan Ellis authored Feb 19, 2020
  
  1f61912a
18 Jan, 2020 1 commit

Cleaned up dependences and install instructions for vggish and yamnet. (#8059) · 831281ce

Manoj Plakal authored Jan 18, 2020

- Made code work with either TF v1.x or TF v2.x, while explicitly
  enabling v1.x behavior.l
- Pulled slim from tf_slim package instead of through tensorflow
  contrib. Note that tf_slim itself uses tensorflow contrib so
  it requires using TF v1.x for now (referenced a relevant PR
  which should remove this limitation once it gets merged).
- Removed all mention of scipy. Switched wav writing to soundfile.
- Switched package name to soundfile instead of pysoundfile. The
  former is the newer name.
- Updated installation instructions for both vggish and yamnet to
  reflect these changes.
- Tested new installation procedures. vggish works with TF v1.15,
  yamnet works with TF v1.15.0 as well as TF v2.1.0.

831281ce

14 Jan, 2020 1 commit
- Fix soundfile import in YAMNet README. (#8040) · fae2b55c
  Dan Ellis authored Jan 13, 2020
  
  fae2b55c
21 Nov, 2019 1 commit
- Adding research/audioset/yamnet, a pre-trained audio event classifier (#7850) · dfffd623
  Dan Ellis authored Nov 21, 2019
```
Add files for YamNet, a sound event classifier.
```
  dfffd623