[Boards: 3 / a / aco / adv / an / asp / b / biz / c / cgl / ck / cm / co / d / diy / e / fa / fit / g / gd / gif / h / hc / his / hm / hr / i / ic / int / jp / k / lgbt / lit / m / mlp / mu / n / news / o / out / p / po / pol / qa / r / r9k / s / s4s / sci / soc / sp / t / tg / toy / trash / trv / tv / u / v / vg / vp / vr / w / wg / wsg / wsr / x / y ] [Home]
4chanarchives logo
Okay anons anyone done any GPU programming here with CUDA? Is it nice?
Images are sometimes not shown due to bandwidth/network limitations. Refreshing the page usually helps.

You are currently reading a thread in /g/ - Technology

Thread replies: 7
Thread images: 2
File: 35_logo.jpg (94 KB, 465x515) Image search: [Google]
35_logo.jpg
94 KB, 465x515
Okay anons anyone done any GPU programming here with CUDA? Is it nice?
>>
>>53905479
Cuda isn't bad. I use it for doing machine learning research using different neural network frameworks that support it. As long as you know c++ you should be fine and should be able to get it after reading up on it.
>>
>>53905479
Not CUDA but I've done programming with OpenGL Compute Shaders.

Nvidia's Compute Shader compiler seems to suck ass though, and all my attempts to use it to speed up my calculations so far have resulted in slower code.

So meh.
>>
>>53905886
>try to speed up calculations
>get slower code
Sure nVidia like to do this in all fields, it seems.
>>
>>53907742
It does real stupid things sometimes, like:

>Accessing a NxN 2D array is slow as fuck, but accessing a N*N 1D array (using x*N+y) is fast.

>Accessing shmem arrays with local ID dependent lookups is slow as fuck, but accessing same-sized textures with local ID dependent integer lookups is fast.

The compiler just fucking sucks. I want to try doing it with Vulkan to see if I can't write my own better optimized assembly.
>>
File: w.png (224 KB, 500x330) Image search: [Google]
w.png
224 KB, 500x330
>>53905479
For deep learning its pretty rad, for sheer number crunching it sucks ass compared to cheaper amd offerings.

>>53905886
>resulted in slower code
Me 2
>>
>>53907962
It seems like the compilers's linker method is fucked up as you describes the problems
Thread replies: 7
Thread images: 2

banner
banner
[Boards: 3 / a / aco / adv / an / asp / b / biz / c / cgl / ck / cm / co / d / diy / e / fa / fit / g / gd / gif / h / hc / his / hm / hr / i / ic / int / jp / k / lgbt / lit / m / mlp / mu / n / news / o / out / p / po / pol / qa / r / r9k / s / s4s / sci / soc / sp / t / tg / toy / trash / trv / tv / u / v / vg / vp / vr / w / wg / wsg / wsr / x / y] [Home]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.
If a post contains personal/copyrighted/illegal content you can contact me at [email protected] with that post and thread number and it will be removed as soon as possible.
DMCA Content Takedown via dmca.com
All images are hosted on imgur.com, send takedown notices to them.
This is a 4chan archive - all of the content originated from them. If you need IP information for a Poster - you need to contact them. This website shows only archived content.