Commits · 09bc9f54fb7084b7908447572938b2e203d7c232 · ModelZoo / ResNet50_tensorflow

08 Sep, 2021 1 commit
- Explicit signatures for tflite. Using ideas from #9688 (#10248) · b66b0b05
  Dan Ellis authored Sep 08, 2021
  
  b66b0b05
08 Jun, 2021 1 commit

Update link to Embedding Colab (#10048) · 19738a07

Dan Ellis authored Jun 08, 2021

The original Colab by malcolmslaney didn't work with the current VGGish/tensorflow.  Changed the link to an updated version.

19738a07

22 Oct, 2020 1 commit
- Updated YAMNet visualization notebook to match latest model code. (#9406) · 2d2582fc
  Manoj Plakal authored Oct 22, 2020
  
  2d2582fc
18 Aug, 2020 1 commit

Added TF-Lite-compatible feature extractor and model exporter for YAMNet (#9098) · 8da48573

Manoj Plakal authored Aug 17, 2020

* Added TF-Lite-compatible feature extractor and model exporter for YAMNet.

- Added a TF-Lite compatible feature extractor. With the latest TF-Lite,
  that involves a DFT-multiplication replacement for tf.abs(tf.signal.stft())
  and not a lot else. Note that TF-Lite now allows variable-length inputs.
- Added a YAMNet exporter that produces TF2 SavedModels, TF-Lite models,
  and TF-JS models.
- Cleanups: switched hyperparameters to a dataclass, got rid of
  some lingering cruft in yamnet_test.

* Responded to DAn's comments in https://github.com/tensorflow/models/pull/9098

- Switched some hparams to float
- Made class map asset available on the exported model, and tested that
  it can be loaded from the various exports.

8da48573

12 Aug, 2020 1 commit

Input/Output tweaks for YAMNet and VGGish. (#9092) · 9b179e8e

Manoj Plakal authored Aug 12, 2020

* Input/Output tweaks for YAMNet and VGGish.

- Waveform input for YAMNet is now padded so that we get at least
  one patch of log mel spectrogram. The VGGish TF-Hub exporter
  uses YAMNet's feature computation so the VGGish export will
  also pad waveform input similarly.
- Added a 1024-D embedding output to YAMNet so we now produce
  predicted scores, log mel spectrogram features, and embeddings,
  to satisfy a variety of uses: class prediction, acoustic
  feature visualization, semantic feature extraction.
- Simplified usage of YAMNet in inference mode. Instead of trying
  to work around implicit batch size issues in the Model.predict()
  API, we simply __call__() the Model.
- Switched inference.py to TF 2 and Eager execution.
- Updated the visualization notebook: now uses TF2/Eager and
  can be loaded and run in Google Colab.

* Responded to DAn's comments in https://github.com/tensorflow/models/pull/9092

- Merged spectrogram computation and framing into a single function
  that returns both spectrogram and framed features.
- Extended waveform padding to pad up to an integral number of hops
  in addition to the final STFT analysis window.

9b179e8e

10 Aug, 2020 2 commits

Added a TF-Hub-compatible SavedModel exporter for VGGish. (#9081) · f7c1734a

Manoj Plakal authored Aug 10, 2020

* Added a TF-Hub-compatible SavedModel exporter for VGGish.

* Responded to DAn's comments, switched linspace to arange everywhere.

f7c1734a

VGGish embeddings are pre-activation, not post-activation. (#9080) · 0f7616bd

Manoj Plakal authored Aug 10, 2020

Fixed a long-standing bug where the released VGGish model used
post-activation embedding output while the released embeddings
were pre-activation. There are still discrepancies due to
other reasons: differences in choice of YouTube transcode,
repeated resamplings with different resamplers, slight differences
in feature computation, etc.

0f7616bd

09 Aug, 2020 1 commit

Made VGGish and YAMNet work in TF2 without disabling TF2 behavior. (#9077) · 557eec27

Manoj Plakal authored Aug 09, 2020

* Made VGGish and YAMNet work in TF2 without disabling TF2 behavior.

Allowed TF2 behavior and allowed passing in a features tensor into the
VGGish model definition. Both of these changes are needed for making
TF-Hub exports of these models. Lifted constraints on TF versions since
tf_slim has been updated to work with TF 2.

* Responded to DAn's comments in https://github.com/tensorflow/models/pull/9077

* Fixed typo in comment.

557eec27

17 Apr, 2020 1 commit
- Remove stray period in yamnet README.md. (#8269) · a29fcd96
  Dan Ellis authored Apr 17, 2020
  
  a29fcd96
13 Apr, 2020 1 commit

Updated README files of research models (#8390) · d466d4e6

Jaeyoun Kim authored Apr 12, 2020

* Update README.md

No Maintenance Intended

* Update README.md

* Update README.md

No Maintenance Intended

* Update README.md

No Maintenance Intended

* Update README.md

No Maintenance Intended

* Update README.md

No Maintenance Intended

* Update README.md

No Maintenance Intended

* Update README.md

No Maintenance Intended

* Update README.md

No Maintenance Intended

* Update README.md

No Maintenance Intended

* Update README.md

No Maintenance Intended

* Update README.md

No Maintenance Intended

* Update README.md

No Maintenance Intended

* Update README.md

No Maintenance Intended

* Update README.md

No Maintenance Intended

* Update README.md

No Maintenance Intended

* Update README.md

No Maintenance Intended

* Update README.md

No Maintenance Intended

* Create README.md

No Maintenance Intended

* Update README.md

No Maintenance Intended

* Update README.md...

d466d4e6

26 Feb, 2020 1 commit
- Add the tf2-compatible graph wrapping idiom to inference.py. (#8203) · 83f56818
  Dan Ellis authored Feb 26, 2020
  
  83f56818
22 Feb, 2020 1 commit

YAMNet variable sr (#8174) · 7b1553fd

Dan Ellis authored Feb 21, 2020


* Modify yamnet_visualization to work for any (?) sampling rate in input wav file.

7b1553fd

20 Feb, 2020 1 commit
- Update yamnet_visualization.ipynb to work under TF2.0. (#8161) · 1f61912a
  Dan Ellis authored Feb 19, 2020
  
  1f61912a
18 Jan, 2020 1 commit

Cleaned up dependences and install instructions for vggish and yamnet. (#8059) · 831281ce

Manoj Plakal authored Jan 18, 2020

- Made code work with either TF v1.x or TF v2.x, while explicitly
  enabling v1.x behavior.l
- Pulled slim from tf_slim package instead of through tensorflow
  contrib. Note that tf_slim itself uses tensorflow contrib so
  it requires using TF v1.x for now (referenced a relevant PR
  which should remove this limitation once it gets merged).
- Removed all mention of scipy. Switched wav writing to soundfile.
- Switched package name to soundfile instead of pysoundfile. The
  former is the newer name.
- Updated installation instructions for both vggish and yamnet to
  reflect these changes.
- Tested new installation procedures. vggish works with TF v1.15,
  yamnet works with TF v1.15.0 as well as TF v2.1.0.

831281ce

14 Jan, 2020 1 commit
- Fix soundfile import in YAMNet README. (#8040) · fae2b55c
  Dan Ellis authored Jan 13, 2020
  
  fae2b55c
13 Jan, 2020 3 commits
- Tolerate missing soundfile module. (#8039) · 39d55d9a
  Dan Ellis authored Jan 13, 2020
```
* Update slim include.

* Force installation of tensorflow 1.14 (not tf2).

* Code will run without soundfile (but die if you attempt to read a sound file).
```
  39d55d9a
- Update slim include. (#8036) · e9233425
  Dan Ellis authored Jan 13, 2020
```
* Update slim include.

* Force installation of tensorflow 1.14 (not tf2).
```
  e9233425
- Update VGGish README for new path, soundfile (#8038) · 8b479cc7
  Dan Ellis authored Jan 13, 2020
  
  8b479cc7
21 Nov, 2019 1 commit
- Adding research/audioset/yamnet, a pre-trained audio event classifier (#7850) · dfffd623
  Dan Ellis authored Nov 21, 2019
```
Add files for YamNet, a sound event classifier.
```
  dfffd623
13 Jun, 2019 1 commit
- Moved research/audioset VGGish code into its own subdirectory (#7009) · 4079c5d9
  Manoj Plakal authored Jun 13, 2019
```
* Moved VGGish code into its own directory.

* Moved most of old README.md into vggish/README.md.
```
  4079c5d9
12 Nov, 2018 1 commit
- Shortened information about used library versions in Audioset README · de51e746
  Samuel Neugber authored Nov 12, 2018
  
  de51e746
08 Nov, 2018 1 commit
- Switching to more robust pysoundfile for reading wav files · dec81ac7
  Samuel Neugber authored Nov 07, 2018
  
  dec81ac7
05 Nov, 2018 1 commit
- Revert "Use explicit relative import syntax for python 3 compatibility in audioset." · 7d2da5cb
  David T.H. Kao authored Nov 05, 2018
  
  7d2da5cb
04 Nov, 2018 1 commit
- Use explicit relative import syntax for python 3 compatibility. · f54d201d
  David Kao authored Nov 03, 2018
  
  f54d201d
20 Sep, 2018 1 commit
- Fixed typo in --train_vggish flag description. · 58877553
  Souradip Mookerjee authored Sep 02, 2018
  
  58877553
15 Aug, 2018 1 commit

Add a pointer to a Colab for the embeddings · 60d6808b

Malcolm Slaney authored Aug 14, 2018

Added a pointer to a Colab that illustrates how to use the public AudioSet models to generate embeddings for user-specified sounds.

60d6808b

13 Feb, 2018 3 commits
- Cosmetic changes to spectrogram_to_mel_matrix() · bdf0f628
  Manoj Plakal authored Feb 13, 2018
```
To simplify syncing this c ode back into Google.
```
  bdf0f628
- Fix double paste. · 982a5504
  Manoj Plakal authored Feb 13, 2018
  
  982a5504
- Add sanity checks to mel computation. · 4b411030
  Manoj Plakal authored Feb 13, 2018
```
Clamp the requested frequency range to [0, Nyquist].
```
  4b411030
09 Feb, 2018 1 commit

Explicitly specify checkpoint version. · d9f6b6f3

Manoj Plakal authored Feb 09, 2018


TF Saver now requires specifying the checkpoint version even when restoring to avoid errors when we specify
a path to a checkpoint file instead of a directory.

d9f6b6f3

15 Oct, 2017 1 commit
- Fixing wrong path · d72df8b0
  Quentin Pleplé authored Oct 15, 2017
  
  d72df8b0
12 Oct, 2017 1 commit
- Concatenate instead of add in vggish_train_demo.py. · a7c84b82
  Manoj Plakal authored Oct 12, 2017
```
Also added 'sudo' to installation instructions.
```
  a7c84b82
23 Sep, 2017 1 commit
- Fix broken links in the models repo (#2445) · 4a705e08
  Neal Wu authored Sep 22, 2017
  
  4a705e08
21 Sep, 2017 1 commit
- Move the research models into a research subfolder (#2430) · f87a58cd
  Neal Wu authored Sep 21, 2017
  
  f87a58cd