OpenCL-CTS

mirror of https://github.com/KhronosGroup/OpenCL-CTS.git synced 2026-03-20 22:39:03 +00:00

Author	SHA1	Message	Date
Ben Ashbaugh	620c689919	update fp16 staging branch from main (#1903 ) * allocations: Move results array from stack to heap (#1857) * allocations: Fix stack overflow * check format fixes * Fix windows stack overflow. (#1839) * thread_dimensions: Avoid combinations of very small LWS and very large GWS (#1856) Modify the existing condition to include extremely small LWS like 1x1 on large GWS values * c11_atomics: Reduce the loopcounter for sequential consistency tests (#1853) Reduce the loop from 1000000 to 500000 since the former value makes the test run too long and cause system issues on certain platforms * Limit individual allocation size using the global memory size (#1835) Signed-off-by: Ahmed Hesham <ahmed.hesham@arm.com> * geometrics: fix Wsign-compare warnings (#1855) Signed-off-by: Sven van Haastregt <sven.vanhaastregt@arm.com> * integer_ops: fix -Wformat warnings (#1860) The main sources of warnings were: * Printing of a `size_t` which requires the `%zu` specifier. * Printing of `cl_long`/`cl_ulong` which is now done using the `PRI64` macros to ensure portability across 32 and 64-bit builds. Signed-off-by: Sven van Haastregt <sven.vanhaastregt@arm.com> Replace OBSOLETE_FORAMT with OBSOLETE_FORMAT (#1776) * Replace OBSOLETE_FORAMT with OBSOLETE_FORMAT In imageHelpers.cpp and few other places in image tests, OBSOLETE_FORMAT is misspelled as OBSOLETE_FORAMT. Fix misspelling by replcaing it with OBSOLETE_FORMAT. Fixes #1769 * Remove code guarded by OBSOLETE_FORMAT Remove code guarded by OBSOLETE_FORMAT as suggested by review comments Fixes #1769 * Fix formating issues for OBSOLETE_FORMAT changes Fix formatting issues observed in files while removing code guarded by OBSOLETE_FORMAT Fixes #1769 * Some more formatting fixes Some more formatting fixes to get CI clean Fixes #1769 * Final Formating fixes Final formatting fixes for #1769 * Enhancement: Thread dimensions user parameters (#1384) * Fix format in the test scope * Add user params to limit testing Add parameters to reduce amount of testing. Helpful for debugging or for machines with lower performance. * Restore default value * Print info only if testing params bigger than 0. * [NFC] conversions: reenable Wunused-but-set-variable (#1845) Remove an assigned-to but unused variable. Reenable the Wunused-but-set-variable warning for the conversions suite, as it now compiles cleanly with this warning enabled. Signed-off-by: Sven van Haastregt <sven.vanhaastregt@arm.com> * Fix bug of conversion from long to double (#1847) * Fix bug of conversion from long to double It the input is long type, it should be load as long type, not ulong. * update long2float * math_brute_force: fix exp/exp2 rlx ULP calculation (#1848) Fix the ULP error calculation for the `exp` and `exp2` builtins in relaxed math mode for the full profile. Previously, the `ulps` value kept being added to while verifying the result buffer in a loop. `ulps` could even become a `NaN` when the input argument being tested was a `NaN`. Signed-off-by: Sven van Haastregt <sven.vanhaastregt@arm.com> * Enable LARGEADDRESSAWARE for 32 bit compilation (#1858) * Enable LARGEADDRESSAWARE for 32 bit compilation 32-bit executables built with MSVC linker have only 2GB virtual memory address space by default, which might not be sufficient for some tests. Enable LARGEADDRESSAWARE linker flag for 32-bit targets to allow tests to handle addresses larger than 2 gigabytes. https://learn.microsoft.com/en-us/cpp/build/reference/largeaddressaware-handle-large-addresses?view=msvc-170 Signed-off-by: Guo, Yilong <yilong.guo@intel.com> * Apply suggestion Co-authored-by: Ben Ashbaugh <ben.ashbaugh@intel.com> --------- Signed-off-by: Guo, Yilong <yilong.guo@intel.com> Co-authored-by: Ben Ashbaugh <ben.ashbaugh@intel.com> * fix return code when readwrite image is not supported (#1873) This function (do_test) starts by testing write and read individually. Both of them can have errors. When readwrite image is not supported, the function returns TEST_SKIPPED_ITSELF potentially masking errors leading to the test returning EXIT_SUCCESS even with errors along the way. * fix macos builds by avoiding double compilation of function_list.cpp for test_spir (#1866) * modernize CMakeLists for test_spir * add the operating system release to the sccache key * include the math brute force function list vs. building it twice * fix the license header on the spirv-new tests (#1865) The source files for the spirv-new tests were using the older Khronos license instead of the proper Apache license. Fixed the license in all source files. * compiler: fix grammar in error message (#1877) Signed-off-by: Sven van Haastregt <sven.vanhaastregt@arm.com> * Updated semaphore tests to use clSemaphoreReImportSyncFdKHR. (#1854) * Updated semaphore tests to use clSemaphoreReImportSyncFdKHR. Additionally updated common semaphore code to handle spec updates that restrict simultaneous importing/exporting of handles. * Fix build issues on CI * gcc build issues * Make clReImportSemaphoreSyncFdKHR a required API call if cl_khr_external_semaphore_sync_fd is present. * Implement signal and wait for all semaphore types. * subgroups: fix for testing too large WG sizes (#1620) It seemed to be a typo; the comment says that it tries to fetch local size for a subgroup count with above max WG size, but it just used the previous subgroup count. The test on purpose sets a SG count to be a larger number than the max work-items in the work group. Given the minimum SG size is 1 WI, it means that there can be a maximum of maximum work-group size of SGs (of 1 WI of size). Thus, if we request a number of SGs that exceeds the local size, the query should fail as expected. * add SPIR-V version testing (#1861) * basic SPIR-V 1.3 testing support * updated script to compile for more SPIR-V versions * switch to general SPIR-V versions test * update copyright text and fix license * improve output while test is running * check for higher SPIR-V versions first * fix formatting * fix the reported platform information for math brute force (#1884) When the math brute force test printed the platform version it always printed information for the first platform in the system, which could be different than the platform for the passed-in device. Fixed by querying the platform from the passed-in device instead. * api tests fix: Use MTdataHolder in test_get_image_info (#1871) * Minor fixes in mutable dispatch tests. (#1829) * Minor fixes in mutable dispatch tests. * Fix size of newWrapper in MutableDispatchSVMArguments. * Fix errnoneus clCommandNDRangeKernelKHR call. Signed-off-by: John Kesapides <john.kesapides@arm.com> * * Set the row_pitch for imageInfo in MutableDispatchImage1DArguments and MutableDispatchImage2DArguments. The row_pitch is used by get_image_size() to calculate the size of the host pointers by generate_random_image_data. Signed-off-by: John Kesapides <john.kesapides@arm.com> --------- Signed-off-by: John Kesapides <john.kesapides@arm.com> * add test for cl_khr_spirv_linkonce_odr (#1226) * initial version of the test with placeholders for linkonce_odr linkage * add OpExtension SPV_KHR_linkonce_odr extension * add check for extension * switch to actual LinkOnceODR linkage * fix formatting * add a test case to ensure a function with linkonce_odr is exported * add back the extension check * fix formatting * undo compiler optimization and actually add the call to function a * [NFC] subgroups: remove unnecessary extern keywords (#1892) In C and C++ all functions have external linkage by default. Also remove the unused `gMTdata` and `test_pipe_functions` declarations. Fixes https://github.com/KhronosGroup/OpenCL-CTS/issues/1137 Signed-off-by: Sven van Haastregt <sven.vanhaastregt@arm.com> * Added cl_khr_fp16 extension support for test_decorate from spirv_new (#1770) * Added cl_khr_fp16 extension support for test_decorate from spirv_new, work in progres * Complemented test_decorate saturation test to support cl_khr_fp16 extension (issue #142) * Fixed clang format * scope of modifications: -changed naming convention of saturation .spvasm files related to test_decorate of spirv_new -restored float to char/uchar saturation tests -few minor corrections * fix ranges for half testing * fix formating * one more formatting fix * remove unused function * use isnan instead of std::isnan isnan is currently implemented as a macro, not as a function, so we can't use std::isnan. * fix Clang warning about inexact conversion --------- Co-authored-by: Ben Ashbaugh <ben.ashbaugh@intel.com> * add support for custom devices (#1891) enable the CTS to run on custom devices --------- Signed-off-by: Ahmed Hesham <ahmed.hesham@arm.com> Signed-off-by: Sven van Haastregt <sven.vanhaastregt@arm.com> Signed-off-by: Guo, Yilong <yilong.guo@intel.com> Signed-off-by: John Kesapides <john.kesapides@arm.com> Co-authored-by: Sreelakshmi Haridas Maruthur <sharidas@quicinc.com> Co-authored-by: Haonan Yang <haonan.yang@intel.com> Co-authored-by: Ahmed Hesham <117350656+ahesham-arm@users.noreply.github.com> Co-authored-by: Sven van Haastregt <sven.vanhaastregt@arm.com> Co-authored-by: niranjanjoshi121 <43807392+niranjanjoshi121@users.noreply.github.com> Co-authored-by: Grzegorz Wawiorko <grzegorz.wawiorko@intel.com> Co-authored-by: Wenwan Xing <wenwan.xing@intel.com> Co-authored-by: Yilong Guo <yilong.guo@intel.com> Co-authored-by: Romaric Jodin <89833130+rjodinchr@users.noreply.github.com> Co-authored-by: joshqti <127994991+joshqti@users.noreply.github.com> Co-authored-by: Pekka Jääskeläinen <pekka.jaaskelainen@tuni.fi> Co-authored-by: imilenkovic00 <155085410+imilenkovic00@users.noreply.github.com> Co-authored-by: John Kesapides <46718829+JohnKesapidesARM@users.noreply.github.com> Co-authored-by: Marcin Hajder <marcin.hajder@gmail.com> Co-authored-by: Aharon Abramson <aharon.abramson@mobileye.com>	2024-03-02 16:48:45 -08:00
Sven van Haastregt	aa34b4b8c3	[NFC] Remove unused depth_lod variables (#1696 ) Remove the remaining unused variables in the test harness. Signed-off-by: Sven van Haastregt <sven.vanhaastregt@arm.com>	2023-04-25 09:16:10 -07:00
Sven van Haastregt	9e0369b307	[NFC] Remove unused printMe variable (#1674 ) Signed-off-by: Sven van Haastregt <sven.vanhaastregt@arm.com>	2023-03-16 06:23:06 +00:00
Ben Ashbaugh	e71a7bce68	Revert "Image streams optimization (#1616 )" (#1638 ) This reverts commit `b73c3149ad`.	2023-02-28 09:06:34 -08:00
Chip Davis	b73c3149ad	Image streams optimization (#1616 ) * Don't recalculate image parameters repeatedly in `test_read_image()` We've already done this in the loop. There's no need to recalculate those parameters over and over again in `sample_image_pixel()` and `read_image_pixel()`. This should save some work during the image streams test. This only affects the 3D tests for now, but my time profiles indicate this is where we spend the most time anyway. * Vectorize read_image_pixel_float() and sample_image_pixel_float() for SSE/AVX This shortens the image streams test time from 45 minutes without it to 37 minutes. Unfortunately, most of the time is now spent waiting for memory, particularly in the 3D tests, because the 3D image doesn't neatly fit in the cache, especially in the linear sampling case, where pixels from two 2D slices must be sampled. Software prefetching won't help; it only helps when execution time is dominated by operations, but this is dominated by memory access. Randomized offsets are likely a factor, because they throw off the hardware prefetcher. One possible further optimization is, in the linear sampling case, to load two sampled pixels at once. This is easy to do using AVX, which extends SSE with 256-bit vectors. Obviously, this only applies to x86 CPUs with SSE2. The greatest performance gains, however, are seen with SSE4.1. Most modern x86 CPus have SSE4. Work is needed to support other CPUs' vector units--ARM Advanced SIMD/NEON is probably the most important one. Another possibility is arranging the code so that the compiler's autovectorization will kick in and do what I did here manually.	2023-02-07 08:46:15 -08:00
Sven van Haastregt	f6a963a583	harness: Fix -Wformat warnings (#1527 ) The main sources of warnings were: * Printing of a `size_t` which requires the `%zu` specifier. * Printing of `cl_long`/`cl_ulong` which is now done using the `PRI*64` macros to ensure portability across 32 and 64-bit builds. Signed-off-by: Sven van Haastregt <sven.vanhaastregt@arm.com> Signed-off-by: Sven van Haastregt <sven.vanhaastregt@arm.com>	2022-10-13 10:02:40 +01:00
Sven van Haastregt	e3e1786761	Fix newline in sample_image_pixel_float_offset log (#1446 )	2022-07-01 15:38:42 +01:00
Romaric Jodin	35c21a8e06	imageHelpers: add CL_UNORM_SHORT_{555, 565} in get_max_absolute_error (#1406 ) * imageHelpers: add CL_UNORM_SHORT_{555, 565} in get_max_absolute_error Working on a device supporting CL_UNORM_SHORT_565 image data type, I noticed that the max absolute error authorized was not the right one for such image data type. Also because of normalization, there is always an absolute error authorized whatever the filtering of the sampler. Ref #1140 * put back if statement on filter_mode	2022-04-28 14:46:52 -07:00
Jim Lewis	03da14d6a9	Fix clang 10 build errors (#1387 ) * Fix clang 10 build errors Lossy casts due to inexact float representation of CL_INT_MAX * Fix clang format * Remove implicit-const-int-float-conversion flag	2022-04-19 09:57:15 -07:00
Ben Ashbaugh	02bf24d2b1	remove min max macros (#1310 ) * remove the MIN and MAX macros and use the std versions instead * fix formatting * fix Arm build * remove additional MIN and MAX macros from compat.h	2021-09-13 13:25:32 +01:00
Stuart Brady	0876ea10be	Ignore padding bits in clCopyImage/clFillImage testing (#1184 ) The CL_UNORM_SHORT_555 and CL_UNORM_INT_101010 formats contain padding bits which need to be ignored in clCopyImage and clFillImage testing. For clFillImage tests, padding was not ignored for the CL_UNORM_SHORT_555 format, and was ignored for CL_UNORM_INT_101010 by modifying actual and reference data. For clCopyImage tests, padding was not ignored, both for CL_UNORM_SHORT_555 and for CL_UNORM_INT_101010. Fix this by adding a new compare_scanlines() function, which is used for both of these formats, and does not modify the actual or reference data. Signed-off-by: Stuart Brady <stuart.brady@arm.com>	2021-05-24 16:59:03 +01:00
Sreelakshmi Haridas Maruthur	6c8045911a	gles: Fix compile warnings. (#1070 ) * gles: Fix compile warnings. For 32 and 64-bit Visual Studio and the Android Q NDK. * Fix formatting violations Co-authored-by: spauls <spauls@qti.qualcomm.com>	2021-05-18 18:10:24 +01:00
Stuart Brady	6f2cd12a0b	Deduplicate logging of pixel differences (#1175 ) clCopyImage and clFillImage contain near-duplicate code for logging of pixel difference errors. Move this into imageHelpers. Signed-off-by: Stuart Brady <stuart.brady@arm.com>	2021-03-09 09:09:52 +00:00
John Kesapides	c587b45a2b	Minor fixes for CL_ARGB channel order. (#1128 ) Signed-off-by: John Kesapides <john.kesapides@arm.com> Change-Id: I4f6bbce14535f6156365a5a46c4739d6a7257ab2	2021-01-29 14:15:16 +00:00
James Price	03a0989998	Use std::vector for format lists in images suite (#1105 ) * Use std::vector for format lists in images suite Avoids memory deallocation issues and generally simplifies the code. * Fixup formatting with git-clang-format	2021-01-14 13:27:59 +00:00
ellnor01	25d9ff5d6e	Using helper functions for clCreateKernel (#1064 ) * Using helper functions for clCreateKernel Uses of clCreateKernel following create program helper functions, have been incorporated into create_single_kernel_helper when suitable. Contributes #31 Signed-off-by: Ellen Norris-Thompson <ellen.norris-thompson@arm.com> * Skip tests using clCompileProgram in offline mode Contributes #31 Signed-off-by: Ellen Norris-Thompson <ellen.norris-thompson@arm.com> * Using type wrappers when using kernel helper functions Also includes fix for windows build Fixes #31 Signed-off-by: Ellen Norris-Thompson <ellen.norris-thompson@arm.com> * Remove clReleaseKernel for wrapped kernel Fixes #31 Signed-off-by: Ellen Norris-Thompson <ellen.norris-thompson@arm.com>	2021-01-07 11:34:42 +00:00
Stuart Brady	af7d914514	Reformat test harness code (#940 ) * Reformat common help text Signed-off-by: Stuart Brady <stuart.brady@arm.com> * Reformat test harness code This goes part of the way to fixing issue #625. Signed-off-by: Stuart Brady <stuart.brady@arm.com>	2020-10-30 14:13:52 +00:00
Chetan Mistry	7a735b74e3	Replace cl_ushort with cl_half (#885 ) (#1000 ) * test_common: Replace cl_ushort with cl_half (#885) Change-Id: I507eca2084629c3b6f3e7331f062f006edbce434 Signed-off-by: Chetankumar Mistry <chetan.mistry@arm.com> * buffers, pipes, profiling: Replace cl_ushort with cl_half (#885) Change-Id: Id9799322b636af6aa0eec3d4e846d7af8c7f9602 Signed-off-by: Chetankumar Mistry <chetan.mistry@arm.com> * images/kernel_read_write: Replace cl_ushort with cl_half (#885) Change-Id: I922ddb593b6e5631d0f4ea1522c7f75f8770be40 Signed-off-by: Chetankumar Mistry <chetan.mistry@arm.com> * half: Replace cl_ushort with cl_half (#885) Change-Id: I484a5bb2b33a7e87805fc6079953c66e5f8d9239 Signed-off-by: Chetankumar Mistry <chetan.mistry@arm.com>	2020-10-02 16:29:05 +01:00
Ben Ashbaugh	951d010eaf	add a new test to verify reported image formats (#963 )	2020-09-28 00:22:49 +01:00
Jesse Natalie	7fd87c704a	Use power-of-two alignment values for allocating pixel data (#827 )	2020-09-11 00:25:58 +01:00
Kévin Petit	ed50fcad2d	Use float<->half conversion routines from the OpenCL headers (#884 ) * Use float<->half conversion routines from the OpenCL headers Fixes #870 Signed-off-by: Kevin Petit <kevin.petit@arm.com> * Use cl_half_from_double * Fix windows build errors * Fix more build errors * Code formatting * Remove TEST class	2020-08-14 13:50:14 +01:00
James Price	944b0a8178	Enable -Werror for GCC/Clang builds (#786 ) * Enable -Werror for GCC/Clang builds Fixes many of the errors this produces, and disables a handful that didn't have solutions that were obvious (to me). * Check for `-W` flags empirically Remove cl_APPLE_fp64_basic_ops support * Undo NAN conversion fix * Add comments to warning override flags * Remove unneeded STRINGIFY definition * Fix tautological compare issue in basic * Use ABS_ERROR macro in image tests * Use fabs for ABS_ERROR macro * Move ABS_ERROR definition to common header	2020-05-27 19:13:11 +01:00
John Kesapides	687dc06254	Failure in copy_images max_images (#612 ) An existing workaround on the max_image size calulcation that disallows the width of an image to be less than 16 can stress the calculcation of the remainder dimension to be less than 1.0 in size. In three dimentional objects (3d,2d array) where one dimention is set to max and the other is set to 16 there might not be enough space left for the 3rd one. This workaround clamps the third dimension to a minimum of 1.0 Signed-off-by: John Kesapides <john.kesapides@arm.com>	2020-03-26 18:41:32 +00:00
Grzegorz Wawiorko	77da52a4f3	Fix image required format - sRGB (#678 )	2020-03-17 10:20:58 +00:00
Kévin Petit	4c5a8fff6d	Conditionally test BGRA in Basic readimage3d (#623 ) (#624 ) * imageHelpers: Created generic function that returns a vector of required image formats. An upcoming commit requires access to the vector of required image formats, separatley from check_minimum_supported. * imageHelpers: Added a new function is_image_format_required. This function can be used to determine for any given cl_image_format, whether the implementaion is required to support it. Conditionally test BGRA in Basic readimage3d (#623) This change adds checks to see if testing against an embedded implementation and if so, queries whether BGRA is supported or not. * Refactor based on PR review. * Update passed message code. * Changed scope of struct to be within test_readimage3d.	2020-03-05 18:47:51 +00:00
Jeremy Kemp	4c59bfa32f	Update table of required image formats (#427 ) (#629 ) * Update table of required image formats (#427) This commit updates the table of required image formats. The table is built depending on the profile of the device, the requested image type, and the avaiablitiy of relevent extensions. * Fixed incorrect argument to memcpy. * Made image format arrays static. * Utilised ARRAY_SIZE where appropriate. * Re-named required image format bools to be more explicit. * Made sRGBA, CL_UNORM_INT8 a required full format profile. Misinterpretation of the spec had made this optional. * check_minimum_supported: switched to using vectors. * Added CL_sRGB CL_UNORM_INT8 to full profile required formats. This matches the same channel data type requirement as CL_sRGBA. * Overload <= and >= for the Version class. * Correct the condition under which sRGB images are required. * Correct the required image formats are based on OpenCL version. The spec says that for different OpenCL versions, different sets of image formats are required. * Print out the correct OpenCL version when required image format is not found. * Improved the way in which image formats are added based on profile and version. * Potential build fix regarding isnan namespace issues. * Image Helpers: Remove duplicate copies when building required image format vectors. Also re-ordered a branch to make it clearer.	2020-03-03 21:39:32 +00:00
Kévin Petit	b93c1df933	Remove duplicate definition of align_{malloc,free} (#631 ) Also use it instead of duplicating the code. Fixes #326 Signed-off-by: Kevin Petit <kevin.petit@arm.com>	2020-02-28 12:22:38 +00:00
jiabaxie	943ba04c0c	set gDeviceType in testharness.c (#597 ) * set gDeviceType in testharness.c, also moved gTestRounding to imageHelpers.cpp & .h and removed duplicate code from host_atomics.cpp * Cleaned up some redundant code * Reversed the change in testharness.c	2020-02-20 10:39:55 +00:00
jiabaxie	68d08e07bf	Moved all instances of gDeviceType to imageHelper.cpp (#575 ) * Moved all instances of gDeviceType to imageHelper.cpp * Missed one instance of gDeviceType * Removed all instances of extern cl_device_type gDeviceType, except in imageHelpers.h	2020-02-06 08:27:13 +01:00
Jim Lewis	040321d8b9	Allow CL_FLOAT denorm flushing for write tests (#28 ) (#456 ) * Require exact for match normals, instead of arbitrary .005 relative error * Add relaxation to allow 0 when float denormal is expected * Refactor to use common validation function	2019-11-15 13:38:15 +00:00
Pavan K Lanka	79d1a14aa0	Fix for https://github.com/KhronosGroup/OpenCL-CTS/issues/346 - causing special float number generation to be skipped (#443 )	2019-09-16 11:37:32 +01:00
Radek Szymanski	03650057bb	Move printing sub-test information into test harness (#421 ) This removes all the duplicated code from each test, and moves it to test harness so that we have single place where this information is printed. Signed-off-by: Radek Szymanski <radek.szymanski@arm.com>	2019-08-05 15:16:12 +01:00
Kévin Petit	6890b58c6f	cl21: Khronos Bug 16192: Make image helpers work on ARM hosts (#260 ) This fix is required due to char defaulting to unsigned on arm platforms while the test code assumes char will be signed. Signed-off-by: Sam Laynton <sam.laynton@arm.com> Signed-off-by: Kevin Petit <kevin.petit@arm.com>	2019-05-03 01:16:17 +08:00
Kévin Petit	8d209840be	cl20: Khronos Bug 16236: Support CL_DEPTH images in the image helpers when using the border colour (#149 ) Signed-off-by: Kevin Petit <kevin.petit@arm.com>	2019-04-10 13:25:13 +01:00
Kevin Petit	95b040bec2	Synchronise with Khronos-private Gitlab branch The maintenance of the conformance tests is moving to Github. This commit contains all the changes that have been done in Gitlab since the first public release of the conformance tests. Signed-off-by: Kevin Petit kevin.petit@arm.com	2019-03-05 16:24:50 +00:00
Kevin Petit	de6c3db41b	Add OSX to the list of Travis CI OSes Signed-off-by: Kevin Petit <kevin.petit@arm.com>	2018-11-09 11:46:07 +00:00
Kedar Patil	2821bf1323	Initial open source release of OpenCL 2.2 CTS.	2017-05-16 18:44:33 +05:30

37 Commits