Newest 'cuda' Questions - Computer Graphics Stack Exchange

2 votes

1 answer

68 views

How is Shader Execution Reordering implemented under the hood?

I am writing a software path tracer from scratch in CUDA (for learning purposes, without resorting to any higher-level graphics API like OptiX), and it is well-known that path tracing can have ...

Enigmatisms

1,112

asked Jun 11 at 2:52

1 vote

1 answer

94 views

Why is my ray tracer not accelerated by CUDA?

Summary: I ported a simple path tracer smallpt into CUDA but the performance doesn't boost. reproduce results: build and run the CPU version: ...

Rahn

183

asked Apr 12 at 3:18

0 votes

0 answers

74 views

What performance to expect from an optimised pathtracer?

I am currently writing a fairly basic pathtracer as a compute shader in C++/CUDA, which has been running on a GTX 1080, and I have noticed that when logging some statistics, I get around ~1 billion ...

Jack Freeth

1

asked Jan 5 at 12:09

0 votes

1 answer

95 views

What is in a GPU driver?

There are 3 kinds of code that are contained in a GPU-utilizing program: CPU code. CPU-to-GPU or GPU-to-CPU bridge code. This is what DirectX actually is. GPU code. In NVidia speak, this is called a &...

Meatball Princess

103

asked May 27, 2023 at 17:12

1 vote

1 answer

944 views

On modern GPUs, what hardware do vertex/pixel shaders have access to that compute shaders don't?

Suppose that someone implemented rendering for a video game using entirely GPGPU apis, like compute shaders or CUDA. Would it be possible to achieve the same performance as standard graphics pipelines ...

while1fork

113

asked Apr 10, 2022 at 16:18

2 votes

1 answer

6k views

Performance difference in OpenGL compute shader vs Vulkan Compute Shader vs Cuda/OpenCL and others

I have been told there is no performance difference if you are skilled enough when it comes to gpu computing. And cuda only performs better because nvidia wants it to. I have also read graphic api's ...

VIVEK Yadav

23

asked Mar 15, 2021 at 6:34

1 vote

1 answer

196 views

Why is the image from the CUDA version not as sharp as the CPU version?

I am extending Roger Allen's Ray Tracing in One Weekend in CUDA (and also here) by following Peter Shirley's Ray Tracing: The Next Week; specifically Image Texture Mapping. The image I obtain from ...

Vectorizer

299

asked Nov 22, 2020 at 8:49

3 votes

1 answer

207 views

Why does my image has a square pattern in cuda path tracer?

I am writting a path tracer in cuda following the Shirley's book. I am having a square pattern as an output. The size of the block change if I change the block dimensions. For this image, the block ...

Kaan E.

461

asked Sep 20, 2020 at 18:14

2 votes

0 answers

1k views

Groupshared memory and parallel reduction over multiple kernel dispatch

I am doing some parallel reduction and mostly following these nVidia slides. However they are not very detailed in places or I might be missing/misunderstanding something. Edit 2: While I figured ...

AverageGatsby

71

asked May 24, 2020 at 18:49

1 vote

0 answers

788 views

How to display image from CUDA / What API to port my project to for real-time raytracing?

I started a raytracing project using the python numba library which provides a just-in-time compiler for CUDA kernels. When the scene is rendered my result is a 1920 x 1080 x 3 RGB array in GPU memory....

Speterius

71

asked Aug 19, 2019 at 18:55

2 votes

1 answer

476 views

Strange sphere reflection effects in ray tracing project. I cannot figure out what causes them

I started a ray tracing project in Python. I made a CUDA kernel and a trace() function for each pixel. It works perfectly when calculating intersections, shadows and lambert shading, however when I ...

Speterius

71

asked Aug 10, 2019 at 15:35

1 vote

0 answers

59 views

Why cache working set per multiprocessor for texture memory in Nvidia has a variable size?

I saw it here https://docs.nvidia.com/cuda/cuda-c-programming-guide/index.html#features-and-technical-specifications__technical-specifications-per-compute-capability , I don't know if it also happens ...

alvaro9650

11

asked Jun 17, 2019 at 10:46

4 votes

2 answers

1k views

Can gl_WorkGroupSize be set outside the compute shader?

I am migrating code from CUDA to OpenGL compute shaders. I could not find a way to set the local workgroup size outside the shader; is there a way to set gl_WorkGroupSize in the calling program like ...

Vectorizer

299

asked Sep 3, 2018 at 7:36

3 votes

1 answer

120 views

How to avoid slowdown with 25-30 students running simple GPU kernels on 4 GeForce GTX 650 Ti s?

So I'm teaching crash-course in CUDA that teaches students how to write good GPU code (CUDA 7.5 in this case). They kernels they will be running will do matrix multiply on 2048x2048 floating point ...

lil' wing

33

asked Jul 19, 2018 at 14:28

2 votes

1 answer

4k views

Can you do GPGPU on integrated graphics like Intel HD 630 on Core i3?

The Core i3 says it has "integrated graphics" with Intel 630. Does this mean it is like a mini GPU? In which case can use things like CUDA to speed up C sharp programs? Is there much software that ...

zooby

121

asked Mar 25, 2018 at 19:38

Stack Exchange Network

Questions tagged [cuda]

How is Shader Execution Reordering implemented under the hood?

Why is my ray tracer not accelerated by CUDA?

What performance to expect from an optimised pathtracer?

What is in a GPU driver?

On modern GPUs, what hardware do vertex/pixel shaders have access to that compute shaders don't?

Performance difference in OpenGL compute shader vs Vulkan Compute Shader vs Cuda/OpenCL and others

Why is the image from the CUDA version not as sharp as the CPU version?

Why does my image has a square pattern in cuda path tracer?

Groupshared memory and parallel reduction over multiple kernel dispatch

How to display image from CUDA / What API to port my project to for real-time raytracing?

Strange sphere reflection effects in ray tracing project. I cannot figure out what causes them

Why cache working set per multiprocessor for texture memory in Nvidia has a variable size?

Can gl_WorkGroupSize be set outside the compute shader?

How to avoid slowdown with 25-30 students running simple GPU kernels on 4 GeForce GTX 650 Ti s?

Can you do GPGPU on integrated graphics like Intel HD 630 on Core i3?

Hot Network Questions

Questions tagged [cuda]

Related Tags