S00 Ray Gen Only
================================
`Corresponding sample code can be found here <https://github.com/gprt-org/GPRT/tree/master/samples/s00-rayGenOnly>`_.

Introduction
------------

Overview
^^^^^^^^

.. The General Purpose Raytracing Toolkit (GPRT) is a powerful and versatile open 
.. source raytracing library designed to enable developers to quickly and easily 
.. prototype ideas involving all things high-performance ray tracing. The API is 
.. written using a combination of C++ and HLSL to support a wide range of features, 
.. including ray-object intersection, high performance tree construction, parallel 
.. compute kernels, and more. GPRT is also optimized for NVIDIA, AMD and Intel Arc 
.. architectures (with support for CPU architectures coming soon), making it 
.. suitable for use in a variety of applications. With its comprehensive feature 
.. set and flexible architecture, we believe that GPRT is an ideal choice for 
.. developers looking to prototype their ray tracing ideas to create stunning 3D 
.. graphics and highly efficient simulation codes.

Welcome to the GPRT samples course! GPRT is an open source raytracing library 
that allows developers to quickly prototype ideas involving high-performance ray 
tracing. It is optimized for NVIDIA, AMD and Intel Arc architectures, and 
support for CPU architectures is coming soon! By following these examples, 
you'll be able to leverage GPRT's comprehensive feature set and create your own 
high performance ray tracing applications.

.. It offers a comprehensive feature set and is suitable 
.. for a variety of applications.

Objective of this Sample
^^^^^^^^^^^^^^^^^^^^^^^^

.. In this first example, we will use GPRT to create an image of a simple 
.. checkerboard pattern. Traditionally speaking, computer graphics libraries will 
.. generate a checkerboard image as a sort-of diagnostic test, to reveal any 
.. possible issues with a display's resolution, color accuracy, or contrast. But 
.. for our purposes, a checkerboard also just so happens to be a very easy pattern 
.. to generate, and allows us to get something up and running before diving into
.. more advanced concepts like acceleration structures, geometry, or ray tracing 
.. pipelines.

In this first example, we'll use GPRT to render a basic checkerboard pattern. 
Traditionally, checkerboards serve as a good diagnostic test to reveal any 
possible issues with a display; but for us, it serves as an easy pattern to generate 
and will enable you to get something running before moving onto more advanced \
concepts.

After running this example, you should see an image like this appear on your 
screen and be saved as s00-rayGenOnly.png in the same directory as the sample's 
executable:

.. image:: ../images/s00-rayGenOnly.png

Setting up a GPRT Sample
------------------------

File Structure
^^^^^^^^^^^^^^
All of the samples in this repo follow a very similar four-file structure. In the 
`s00-rayGenOnly <https://github.com/gprt-org/GPRT/tree/master/samples/s00-rayGenOnly>`_
directory, we have four files: CMakeLists.txt, sharedCode.h, deviceCode.hlsl, 
and hostCode.cpp. 

`CMakeLists.txt <https://github.com/gprt-org/GPRT/blob/master/samples/s00-rayGenOnly/CMakeLists.txt>`_ 
tells our CMake build system how to compile our code into an executable or library. 

`sharedCode.h <https://github.com/gprt-org/GPRT/blob/master/samples/s00-rayGenOnly/sharedCode.h>`_ 
defines common data structures shared between our ray tracing device and our host 
system. The code in this file needs to be carefully written to compile with both HLSL *and* C++. 

`deviceCode.hlsl <https://github.com/gprt-org/GPRT/blob/master/samples/s00-rayGenOnly/deviceCode.hlsl>`_ 
defines all programs we would like to run in parallel on our ray tracing 
device, written using the HLSL programming language. You might also see these 
programs called *shaders* or *kernels*. 

`hostCode.cpp <https://github.com/gprt-org/GPRT/blob/master/samples/s00-rayGenOnly/hostCode.cpp>`_ 
defines our main function on the CPU that we will use to orchestrate what 
kernels to run on the device and how. This might include what 
parameters we pass to these kernels, how many threads to launch, and so on.

Configuring the Build System
^^^^^^^^^^^^^^^^^^^^^^^^^^^^
The first step in using GPRT is to configure how to compile our code together
into an executable. Our samples do this by using the *CMakeLists.txt* files. In 
a typical workflow, this CMake file would be configured as follows:

.. literalinclude:: ../../../samples/s00-rayGenOnly/CMakeLists.txt
   :language: cmake
   :lines: 23-38


The `embed_devicecode` macro sets up a compilation target for our 
*deviceCode.hlsl* into assembly we can run on our raytracing device. 
This assembly is then embedded into a .cpp file matching 
the ``OUTPUT_TARGET`` name. 

.. note::
   For another minimal example, `check out the CMakeLists.txt in this project
   <https://github.com/gprt-org/h5m-reader/blob/main/CMakeLists.txt>`_

Creating our Checkerboard Test Pattern
--------------------------------------
The second step is to write our GPU program.
To create our checkerboard test pattern, we'll write a small program that will 
run in parallel on our ray tracing device, where we'll use individual threads to 
generate our pixel colors. 

Shared Code
^^^^^^^^^^^
To begin, we'll define some structures that will be shared between our host system
and our ray tracing device, which we'll declare in our *sharedCode.h* file. 

.. literalinclude:: ../../../samples/s00-rayGenOnly/sharedCode.h
   :language: c++
   :lines: 23-29

In this struct, we'll store the two colors to use for our checkerboard pattern, 
as well as a frame buffer of pixels to store the results into.

Device Code
^^^^^^^^^^^

Next, we'll create a *Ray Generation Program* that will run in parallel on the 
device. Normally, we use these kernels to generate rays to trace into the world. 
However, in this example we'll just be creating a checkerboard background. 


.. literalinclude:: ../../../samples/s00-rayGenOnly/deviceCode.hlsl
   :language: hlsl
   :lines: 23-46

In the code above, we declare our ray generation program using the macro 
``GPRT_RAYGEN_PROGRAM``. 
This macro is defined in `gprt_device.h <https://github.com/gprt-org/GPRT/blob/master/gprt/gprt_device.h>`_,
and it's purpose is to allow us to write multiple kernels all in the same HLSL
document. 

.. This macro takes as input the *name* of our program, followed
.. by the type and name of this kernel's "shader record".
This macro takes in the name of the kernel and the type and name of its *shader 
record*. In GPRT, every kernel receives a shader record, which acts as a block 
of parameters that are made available to the kernel when it's executed on the device.

This raygen kernel runs the same code in parallel over many different threads. In
our case, we will run one thread per pixel. We read the thread ID using 
``DispatchRaysIndex``, as well as how many threads were launched using 
``DispatchRaysDimensions``. We use the thread ID to determine which checker type 
our pixel lies within, and use ``gprt::store`` to store our color into our 
framebuffer at the given location.

Also, just like on the CPU, we can use printf to print out helpful debug messages! 

Host Code
^^^^^^^^^

As our final step, all that's left is to write our host side code. 
We begin by requesting a window and creating a ``GPRTContext``: 

.. that we will use to show our checkerboard pattern. 
.. Then, we create our ``GPRTContext``, which under the hood initializes our underlying
.. ray tracing framework and selects the devices we'll run our kernel on.

.. literalinclude:: ../../../samples/s00-rayGenOnly/hostCode.cpp
   :language: c++
   :lines: 58-66

After that, we create a ``GPRTModule``, which acts as a container that will hold 
our compiled device kernels.

.. literalinclude:: ../../../samples/s00-rayGenOnly/hostCode.cpp
   :language: c++
   :lines: 68-72

Because we're using two different devices to run our code (normally a CPU and a GPU), we have 
two separate compilation steps. The device code is compiled first, then the binary from that 
compilation step is embedded into our host side code. This way, GPRT executables are "standalone",
and don't need to carry around compiled shader files. 

Creating our Raytracing Pipeline
""""""""""""""""""""""""""""""""
Next, we'll setup our ray tracing pipeline. We'll talk more about the ray tracing 
pipeline in the next example, but essentially, it's a collection of GPU programs 
that all operate together. 

In this example, our ray tracing pipeline is super simple: just a single ray generation 
program. 

.. literalinclude:: ../../../samples/s00-rayGenOnly/hostCode.cpp
   :language: c++
   :lines: 73-78

.. note::
   To create a handle to our ray generation program, we need to pass the 
   name of the program--here it's "simpleRayGen"--as well as the shader record 
   type--which is our `struct RayGenData` that we previously declared in our 
   "sharedCode.h" file.

.. note::
   Many GPRT handles can be made in one of two ways: ``GPRTHandle`` and 
   ``GPRTHandleOf<T>``. The first is a more raw C-like API, while the latter 
   uses C++ templates to increase type safety and help users reduce bugs 
   that might occur from incorrect ``void*`` casting. 

Setting Parameters
""""""""""""""""""
Next, we can pass data back and forth between the ray tracing device in two ways: through
constant uniform values, and through buffers. Uniform values are like single values, 
like our two ``float3`` color values used by our checkerboard. Buffers on the 
other hand are allocated regions of memory which contain many values--for example,
an array of pixel color values. Buffers can be read from and written to by both 
the host and the device, while uniforms can only be written to by the host, and 
are read-only on the device.

To represent our checkerboard image, we'll use a buffer of ``uint32_t``, 
where the first 8 bits represent the blue channel, the next 8 bits represent the 
green channel, then red, and finally alpha. 

.. literalinclude:: ../../../samples/s00-rayGenOnly/hostCode.cpp
   :language: c++
   :lines: 84-86

.. note::
   To create a buffer, we can use either ``gprtDeviceBufferCreate``, ``gprtHostBufferCreate``,
   or ``gprtSharedBufferCreate``. As their names suggest, buffers made by 
   ``gprtDeviceBufferCreate`` will reside on our ray tracing device, while buffers 
   made with ``gprtHostBufferCreate`` will reside on our host system memory. 
   Buffers made with ``gprtSharedBufferCreate`` reside in a memory space shared
   between the host and the device called "BAR", and is limited to 256MB on 
   systems without resizable BAR.

Next, we'll pass our uniform values through the shader record belonging to our 
ray generation program. For our example, these uniforms are our checkerboard 
colors, as well as the device address to our frame buffer. To pass these uniforms, 
we first fetch a handle to our shader record using ``gprtRayGetPointer``. Once 
all parameters are set, we upload the values of these parameters to our device 
by calling ``gprtBuildShaderBindingTable``. 

.. literalinclude:: ../../../samples/s00-rayGenOnly/hostCode.cpp
   :language: c++
   :lines: 88-98

Launching our Program
"""""""""""""""""""""
Finally, we can launch our ray generation program to create our checkerboard image. 
To do this, we call ``gprtRayGenLaunch2D``, giving our ray generation program to use as 
well as the number of threads we would like to execute concurrently. Then, to present 
our framebuffer to the screen, we call ``gprtBufferPresent``. We do this in a loop until 
the window's  "X" button has been clicked, at which point we save the framebuffer to an 
image using ``gprtBufferSaveImage``.

.. literalinclude:: ../../../samples/s00-rayGenOnly/hostCode.cpp
   :language: c++
   :lines: 103-117

Cleaning Up
"""""""""""
When our program completes, we need to destroy all the objects we created. The 
order that these objects are destroyed is important, because some objects depend
on others to function properly. In general, we destroy our objects in the reverse 
order that they were made. 

.. literalinclude:: ../../../samples/s00-rayGenOnly/hostCode.cpp
   :language: c++
   :lines: 123-127

Conclusion
----------
If you've followed along so far, congrats! Although this example is relatively simple, 
we've covered a lot of the key concepts you need to know in order to get up and running 
tracing rays. 

In the next example, we'll create a single triangle, and trace rays to intersect that triangle.
We'll be able to manipulate this triangle as well using some very simple camera math.