softmax kernel #106

SimonDanisch · 2018-02-26T22:24:41Z

No description provided.

DilumAluthge · 2018-07-16T22:00:52Z

@SimonDanisch @MikeInnes What was the conclusion regarding this PR?

jekbradbury · 2018-07-17T07:26:03Z

I have a working, reasonably fast, but not very generic CUDA softmax in https://github.com/jekbradbury/Transformer.jl/blob/master/src/kernels.jl

DilumAluthge · 2018-07-17T16:35:27Z

Yeah looks like it’s relatively CUDA-specific. I wonder if it would be easier to port James’s kernel to OpenCL versus writing the OpenCL softmax kernel from scratch.

On Tue, Jul 17, 2018 at 03:26 James Bradbury ***@***.***> wrote: I have a working, reasonably fast, but not very generic CUDA softmax in https://github.com/jekbradbury/Transformer.jl/blob/master/src/kernels.jl — You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub <#106 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AFXAraOOvJrUq65Uo0GsOlpZNlI13kPSks5uHZGMgaJpZM4SUB7x> .

-- *Dilum Aluthge* *dilum@aluthge.com <dilum@aluthge.com>* *https://www.aluthge.com <https://www.aluthge.com>*

SimonDanisch · 2018-07-17T18:22:54Z

I wonder if it would be easier to port James’s kernel to OpenCL versus

We can just port it to julia in a way that it works with both CLArrays + CuArrays.

I already took a look at it - the only thing holding us back is, that @jekbradbury used dynamic shared memory, that behaves a bit peculiar compared to the CuStaticSharedMem (which is also supported by CLArrays, when you use the GPUArray version). I had a stab at supporting dynamic shared memory in GPUArrays vendor independently, but couldn't implement it in the time frame I set myself... In theory it's quite straightforward and I should make a PR out of what I had ;)

jekbradbury · 2018-07-17T19:24:31Z

I don't know if there's any particular reason Marian-NMT used dynamic shared memory for this rather than static. (Also, this kernel contains a reasonably fast mapreducedim implementation for reductions over the inner dim, so it would be useful to include that separately if someone works on porting)

stab at softmax kernel

5d373d0

SimonDanisch mentioned this pull request Feb 26, 2018

Generic softmax #96

Open

maleadt force-pushed the master branch 4 times, most recently from 39e7783 to fef2421 Compare December 11, 2018 15:49

maleadt changed the title ~~stab at softmax kernel~~ WIP: softmax kernel Feb 23, 2019

maleadt force-pushed the master branch from 5cf111d to cb79e08 Compare March 31, 2020 16:39

maleadt marked this pull request as draft May 8, 2020 10:39

maleadt changed the title ~~WIP: softmax kernel~~ softmax kernel May 8, 2020

maleadt added enhancement stale labels May 8, 2020

maleadt force-pushed the master branch from 8770e66 to c2240f0 Compare May 14, 2021 07:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

softmax kernel #106

softmax kernel #106

SimonDanisch commented Feb 26, 2018

DilumAluthge commented Jul 16, 2018

jekbradbury commented Jul 17, 2018

DilumAluthge commented Jul 17, 2018 via email

SimonDanisch commented Jul 17, 2018

jekbradbury commented Jul 17, 2018

softmax kernel #106

Are you sure you want to change the base?

softmax kernel #106

Conversation

SimonDanisch commented Feb 26, 2018

DilumAluthge commented Jul 16, 2018

jekbradbury commented Jul 17, 2018

DilumAluthge commented Jul 17, 2018 via email

SimonDanisch commented Jul 17, 2018

jekbradbury commented Jul 17, 2018