Qestions of ROCm and the relation between it and HSA
Hello all,After doing some research online and reading the introduction of ROCm. I'm confused on what HSA (AMD HSA Runtime) and ROCm does. They seems to both provide the same thing(HSA APIs) and that...
View ArticleExplanation of terms from chapter 9 of GCN ISA manual
Hello, I am not allowed to post in the HSA section (or is there a GCN/ROCm section?), hopefully someone knowledgeable will find the question here. Chapter 9 of GCN ISA Manual (the version from August...
View ArticleROCm documentation feedback
In case it is helpful to ROCm doc editors, attached are some questions, formatting problems, miswordings, and typos/misspellngs I encountered while reading the first pages (Welcome, Release Notes,...
View ArticleHSA - Sample code - Creating the queue failed
Hello. I am getting my machine setup with HSA for a project. I managed to follow the instruction installation steps at the HSA's Github page. I am now trying to run the sample program vector_copy. It...
View ArticleGood way to share carry flag cross lane?
I need to add two ulong.The first half (low 32 bits) of each number is stored in lanes with even id, and the second half (high 32 bits) is stored in the lane next to it (next odd id).So one lane adds...
View ArticleGet element from an array in a paticular lane
Say I have 8 threads, I want to get array tsum[laneID%8] from lane 0/8 for thread of laneID.One possible way is to use 16 move dpp instructions with row shift, like this:__asm ( \ "s_nop 1\n" \...
View ArticleGPUOpen Anaconda links broken
On the pagehttps://gpuopen.com/compute-product/continuum-analytics-anaconda-with-numba/ The document link is brokenhttp://numba.pydata.org/numba-doc/latest/hsa/index.html (404 Page not found) Maybe it...
View Article"Radeon Open Compute Platform (ROCm)" link is broken
On the community HSA overview page, the "Radeon Open Compute Platform (ROCm)" link is broken.It leads tohttps://radeonopencompute.github.io/ (404 Site not found) but maybe it should lead...
View ArticleTensorflow with AMD GPU
I just bought a new Desktop with Ryzen 5 CPU and an AMD GPU to learn GPU programming. I am also interested in learning Tensorflow for deep neural networks. After a few days of fiddling with tensorflow...
View ArticleROCm readthedocs column width, table wrapping
The ROCm documentation on the readthedocs io site is written in reStructuredText (.rst).It is compiled and uses the readthedocs sitewide theme.css. The theme.css is designed to be responsive to varying...
View Article