Replies: 5 comments 2 replies
-
Interesting! What's the new model type (and what's the intuition behind why it is better)? A couple thoughts:
|
Beta Was this translation helpful? Give feedback.
-
I continued to explore the CPP by compiling a list of resources and datasets, and I drafted an initial writeup describing the problem and the plan of attack. I’m not sure what the best way to share this document is since I can’t seem to find a way to upload it to the ‘Projects’ section on GitHub. As I dove into the literature, I found an interesting paper that implemented wavelet scattering transforms, which yield translation-invariant time-frequency representations of input data. I experimented with this technique and ended up with ~99% accuracy on the macaque dataset using a CNN-based architecture. It might be worth adding this to the representation toolbox, especially since it seems that a CNN-based approach might be well-suited for this representation as opposed to a usual spectrogram representation. |
Beta Was this translation helpful? Give feedback.
-
I dove deeper into the literature (which is very extensive!!) with a focus on both biology- and computer science-related publications. I think this is an interesting approach, since a number of papers have been exploring the possibility of constructing biology/physiology-inspired computational models to address the CPP. Thus, it seems to make sense to understand both the biology of the problem and the computer science of the problem at a fundamental level. As Britt suggested during our call wrt the timeline of research, I'm thinking it might be worthwhile to do some sort of visualization to show the existing body of research. However, before getting to this, I think it might be reasonable to continuing parsing the literature for inspiration and selecting a handful of papers to explore more deeply - I'm planning to add something like a bookshelf to the CPP project/repo. As I touched on during our call, I've formulated a draft roadmap for addressing the CPP, so I'll make sure to push this to the appropriate repo before our next sync! |
Beta Was this translation helpful? Give feedback.
-
|
Beta Was this translation helpful? Give feedback.
-
I've started to explore the Asteroid package. It's actually super cool and very comprehensive - the only thing is the recipes take a very long time to run, but I think that understanding the implementation of the techniques included will provide a ton of insight into how best to address the CPP in bioacoustics applications. |
Beta Was this translation helpful? Give feedback.
-
Firstly, I experimented with the dolphin stranding inference runs, and given that we've been working on this problem for less than 1 week, I suppose I am cautiously optimistic. But I'm also wondering if we planning to continue with this project (?).
Also, I continued to explore the concept of learned transforms (time-frequency representations), but after experimenting with a number of techniques in an attempt to improve the results, I decided to explore more deeply the fundamentals of the neural network architectures used in acoustic ML problems. Over the past few weeks, I've been considering the problem of how to construct an optimal neural network model, and today--after reading some intro resources on the cocktail party problem (CPP)--I experimented with a new type of model with surprisingly promising results (for instance, the simple CNN-based toy model achieved 96.9% accuracy on the macaque dataset, but the new model achieved 98.9%). I have a few ideas in mind for modifying the architecture in the hopes that it could help to address a number of general bioacoustics research problems in addition to the CPP-related problems of sequential and simultaneous integration (and segregation) of acoustic signals.
Beta Was this translation helpful? Give feedback.
All reactions