Andrew Robson, Simon Barnes Chief Sports Writer, James Christopher's, Raymond Snoddy Media Editor, David Adams, Robert Cole, Matthew Syed, Andrew Pierce, John Whitelegg Senior policy officer, Adam Sherwin Media Reporter, Frances Gibb Legal Editor, Magnus Linklater, Robert Thicknesse, Mark Souster, Andrew de Berry, Ian Blackshaw, Richard Hobson, Ivo Tennant, Tessa Jowell, Philip Howard, Oliver Kay and George Caulkin, David Chater, James Bone, Nic Hopkins, Barrie Behenna, Clive Davis, Alexandra Frean Social Affairs Correspondent, Mark Griffiths Horticulture Correspondent, Patrick Kidd, Gillian Harris Scotland Correspondent, Ivan Lawrence (MP), Ginny Dougary, Barbara Ellen, Brian E. Saunders, Kevin Eason, Neil Howlett, Joe Joseph, Ben Webster Transport Correspondent, Peter Wyman, Valerie Elliott Countryside Editor, Tina Brown, Steve Keenan, Thrasy Petropoulos, Stewart Tendler Crime Correspondent, Jeremy Westwood, Jennai Cox Fitness Editor, Anthony Browne, Dominic Walsh, Debra Craine, Norman Harris, Daniel Finkelstein, Charles Heyman, Tim Reid, Dan Sabbagh, Dalya Alberge Arts Correspondent, Robert Atkins, Ben Macintyre Parlimentary Sketch, Phoebe Greenwood, Lisa Murch, Alan Hamilton, Oliver Wright, Ian Cobain and Michael Evans, Jenny MacArthur, Demetrios Matheou, Anatole Kaletsky, Phil Gordon, Roland Watson, Richard Lloyd Parry, Ron Lewis, Peter D. Rossdale (Editor), Jonathan Porter, Matt Dickinson Chief Football Correspondent, Caroline Merrell Banking Correspondent, Angela Jameson, Tony Halpin Education Editor, Daniel McGrory, Oliver Kay, John Westerby, Tom Baldwin, Mark Henderson Science Correspondent, Sam Coates, Jill Sherman Whitehall Editor, Christopher Martin-Jenkins, John Hopkins Golf Correspondent, Rob Wright, Geoffrey Dean, Elaine Monaghan, Gary Duncan and Joe Bolger, Dan Sabbagh Telecoms Correspondent, Raymond Keene, Lewis Smith, Matthew Pryor, Richard Miles Investment Editor, Matt Dickinson, Sean MacAulay, Philip Whiteley, Peter Kimm, David Walton, Tom McIntyre, Gwen Staveley Teacher, George Caulkin and Peter Lansley, Benji Wilson, Chris Partridge, Beryl Dixon, Greg Hurst Parliamentary Correspondent, Nigel Hawkes, Peta Bee, Neville Scott, Chris Campling, Bronwen Maddox, Richard Owen, Tony Halpin and Glen Owen, Christopher Martin-Jenkins Chief Cricket Correspondent, Roger Boyes, David Mattin, Oliver August, Peter Dixon, Nick Hasell, Joanne Hart, Updesh Kapur, David Hands Rugby Correspondent, Glen Owen, Mike Mulvihill, John Goodbody, Philip Webster Political Editor, Melissa Kite Political Correspondent, Christopher Walker, Des Dearlove, Ian Johns, Mike Mason, John Crossland, Anjana Ahuja, Ian Cobain David Lister and Gabriel Rozenberg, Chris McGrath, Caitlin Moran, Ingrid Mansell, Alan Lee, Peter Riddell, Oliver Chastney, Nigel Massen, Tony Dawe, David Charter Chief Political Correspondent, Michael Dynes, Jack Malvern Arts Reporter, Gary Duncan Economics Correspondent, Martin Waller, Chris Ayres, Peter Inson (Headmaster), Mark Baldwin, Nigel Hawkes Health Editor, Sarah Butler, Benedict Nightingale, Dr Thomas Stuttaford, Stephen Dalton, Leo Lewis, Alyson Rudd, Frances Gibb, Richard Morrison, Abigail Rayner, Julian Ryall, Mark Thomas, Owen Slot, Robbie Millen, Jenny Davey, Colin McQuillan, Robin Shepherd, Patience Wheatcroft, David Alston, Lisa Verrico, David Lister Ireland Correspondent,
... CV's Home Based Business CMR OCPA Regulated NVIDIA Odgers HIGH growth Boxwood BSi Wragge&Co TAL ...
2003 - Gale Group | TDA

Vladimir Lončar, Luis E. Young-S., Srdjan Škrbić, Paulsamy Muruganandam, Sadhan K. Adhikari, Antun Balaž,
... as that a computer or a cluster has Nvidia GPU with Compute Capability 2.0 or higher, ... used for OpenMP/MPI version, and all available Nvidia GPUs across all cluster nodes used for CUDA/ ... typical modern computers. A parallel implementation exists, using Nvidia CUDA [2], and both versions are already used ... cores) with 32 GB of RAM and one Nvidia Tesla M2090 GPU with 6 GB of RAM, ... of DBEC-GP-MPI-CUDA programs compiled with Nvidia's nvcc compiler, with CUDA-aware OpenMPI implementation ... org/fftw3_doc/Real_002ddata-DFTs.html (2014); Nvidia's cuFFT accuracy and performance, http://docs.nvidia. ...
Tópico(s): Cold Atom Physics and Bose-Einstein Condensates
2016 - Elsevier BV | Computer Physics Communications
Mubeen Ghafoor, Shahzaib Iqbal, Syed Ali Tariq, Imtiaz Ahmad Taj, Noman M. Jafri,
... The GPU technology has been revolutionised, especially when NVIDIA [23-25] introduced 'compute unified device architecture' (CUDA) ... optimised implementation of MCC-based fingerprint matcher on NVIDIA's Tesla and GeForce GPUs and claimed a ... Section 2. Section 3 discusses the GPU and NVIDIA CUDA architecture. Section 4 discusses the proposed implementation ... brief overview of the GPU architecture and introduces NVIDIA CUDA programming architecture. 3 GPU and NVIDIA CUDA architecture To transform or map CPU algorithm ...
Tópico(s): Forensic Fingerprint Detection Methods
2017 - Institution of Engineering and Technology | IET Image Processing
Toru TAKAHASHI, Cris Cecka, William Fong, Eric Darve,
... to run the fast multipole method (FMM) on NVIDIA CUDA‐capable graphical processing units (GPUs) (Nvidia Corporation, Sta. Clara, CA, USA). The FMM is ... achieved performance over 200 Gflop/s on one NVIDIA Tesla C1060 GPU (Nvidia Corporation, Sta. Clara, CA, USA). This was compared ... cache misses. We also present benchmarks on an NVIDIA C2050 GPU (a Fermi processor)(Nvidia Corporation, Sta. Clara, CA, USA) in single and ...
Tópico(s): Antenna Design and Analysis
2011 - Wiley | International Journal for Numerical Methods in Engineering
Chunpeng Wu, Hsin‐Pai Cheng, Sicheng Li, Hai Li, Yiran Chen,
... in this section. Our GPU device is the NVIDIA TITAN X and cuDNN v5 [17] is adopted. ... our method can be further speed up using NVIDIA's techniques such as dynamic parallelism and hyper- ... blue) on Cityscapes. The GPU device used is NVIDIA GTX 1080. There are two ApesBlock and one ... JMLR), 2015, 37, pp. 1- 9 17https://developer.nvidia.com/cudnn 18Simonyan, K., Zisserman, A.: ' Very deep ... performance on ImageNet classification', arxiv, 2015 30https://www.nvidia.com/content/PDF/kepler/NVIDIA-Kepler-GK110-Architecture- ...
Tópico(s): Autonomous Vehicle Technology and Safety
2016 - Institution of Engineering and Technology | IET Cyber-Physical Systems Theory & Applications
Massimiliano Fasi, Nicholas J. Higham, Mantas Mikaitis, Srikara Pranesh,
... explore the floating-point arithmetic implemented in the NVIDIA tensor cores, which are hardware accelerators for mixed- ... are normalized. These aspects are not documented by NVIDIA, and we gain insight by running carefully designed ... important if one wishes to: (1) accurately simulate NVIDIA tensor cores on conventional hardware; (2) understand the ... build custom hardware whose behavior matches that of NVIDIA tensor cores. As part of this work we ... easily adapted to test newer versions of the NVIDIA tensor cores as well as similar accelerators from ...
Tópico(s): Low-power high-performance VLSI design
2021 - PeerJ, Inc. | PeerJ Computer Science
Wenli Zhang, Yuxin Liu, Kaizhen Chen, Huibin Li, Yulin Duan, Wenbin Wu, Yun Shi, Wei Guo,
... proposed algorithm was tested on three edge devices: NVIDIA Jetson Xavier NX, NVIDIA Jetson TX2, and NVIDIA Jetson NANO. The experimental results show that the ... respectively. Deploying the algorithm, the detection speed of NVIDIA Jetson Xavier NX reaches 21.3, 24.8, and 22.2 FPS, while that of NVIDIA Jetson TX2 reaches 13.9, 14.1, and 14.5 FPS and that of NVIDIA Jetson NANO reaches 6.3, 5.0, and ...
Tópico(s): Remote Sensing in Agriculture
2021 - Frontiers Media | Frontiers in Plant Science
Paul Martinsen, Johannes Blaschke, Rainer Künnemeyer, R. B. Jordan,
... in turbid media has been implemented on an NVIDIA® 8800gt graphics card using the CUDA toolkit. The ... Designed for Intel PCs. Phoogle-G requires a NVIDIA graphics card with support for CUDA 1.1 ... PC and a consumer grade graphics card from NVIDIA. Restrictions: The graphics card implementation uses single precision ... optical properties of the medium. References: http://www.nvidia.com/object/cuda_home.html. S. Prahl, M. ...
Tópico(s): Digital Radiography and Breast Imaging
2009 - Elsevier BV | Computer Physics Communications
Asim Munawar, Mohamed Wahib, Masaharu Munetomo, Kiyoshi Akama,
... SAT) problem on a state-of-the-art nVidia Tesla GPU using nVidia Compute Unified Device Architecture (CUDA). MAX-SAT is ... used for an efficient implementation of GAs over nVidia GPUs. We also design and introduce new techniques/ ... GAs and LS over such architectures. We use nVidia Tesla C1060 to perform several numerical tests and ...
Tópico(s): Algorithms and Data Compression
2009 - Springer Science+Business Media | Genetic Programming and Evolvable Machines
M J Harvey, Gianni De Fabritiis,
... programming model supported exclusively by GPUs manufactured by NVIDIA. An industry standardisation effort has recently produced the ... shown to have platform independence, running on both NVIDIA and AMD GPUs without modification. We conclude that ... RAM: 256 Mbytes Classification: 6.5 External routines: NVIDIA CUDA, OpenCL Nature of problem: Graphical Processing Units (GPUs) from NVIDIA are preferentially programed with the proprietary CUDA programming ... to CUDA and is also supported on non-NVIDIA hardware (including multicore ×86 CPUs, AMD GPUs and ...
Tópico(s): Software Testing and Debugging Techniques
2011 - Elsevier BV | Computer Physics Communications
Ziwen Dou, Dong Ye, Boya Wang,
... further boost accuracy. Finally, we accelerated AutoSegEdge using NVIDIA's TensorRT and deployed it on the Nvidia Jetson NX. Experiments demonstrate that multi-objectives NAS ... to obtain the best result on a single Nvidia Tesla V100 GPU. On the Cityscapes dataset, AutoSegEdge ... 70.3% with 16.6 FPS on the Nvidia Jetson NX (and 194.54 FPS on an Nvidia Tesla V100 GPU) at the original resolution (1024 × ...
Tópico(s): Domain Adaptation and Few-Shot Learning
2023 - Elsevier BV | Image and Vision Computing
Jinwoo Jeon, Sungwook Jung, Eungchang Mason Lee, Duckyu Choi, Hyun Myung,
... tests of various visual(-inertial) odometry algorithms on NVIDIA Jetson platforms. The compared algorithms include mono and ... and weight is limited. Jetson boards released by NVIDIA satisfy these constraints as they have a sufficiently ... study compares representative VO/VIO algorithms on several NVIDIA Jetson platforms, namely NVIDIA Jetson TX2, Xavier NX, and AGX Xavier, and ...
Tópico(s): Inertial Sensor and Navigation
2021 - Institute of Electrical and Electronics Engineers | IEEE Robotics and Automation Letters
... which is a parallel programming environment supported on NVIDIA GPUs, and emulated on less parallel CPUs. Given ... and theoretical. The authors are both affiliated with NVIDIA; Kirk is an NVIDIA Fellow and Hwu is principle investigator for the first NVIDIA CUDA Center of Excellence at the University of ...
Tópico(s): Cloud Computing and Resource Management
2010 - | Scalable Computing Practice and Experience
Mahantesh Halappanavar, John Feo, Oreste Villa, Antonino Tumeo, Alex Pothen,
... multicore (Intel Nehalem and AMD Magny-Cours), manycore (Nvidia Tesla and Nvidia Fermi), and massively multithreaded (Cray XMT) platforms. We ... cores of Intel Nehalem, [Formula: see text] on Nvidia Tesla and [Formula: see text] on Nvidia Fermi relative to one core of Intel Nehalem, ...
Tópico(s): Caching and Content Delivery
2012 - SAGE Publishing | The International Journal of High Performance Computing Applications
N. Govender, Daniël N. Wilke, Schalk Kok,
... and heuristics that are optimized for the parallel NVIDIA Kepler GPU architecture in detail. This includes a ... addition, we present heuristics optimized for the parallel NVIDIA Kepler GPU architecture. Our algorithms have minimalistic memory ... by simulating 34 million polyhedra on a single NVIDIA K6000 GPU. We show that by using the ...
Tópico(s): Mineral Processing and Grinding
2014 - Elsevier BV | Applied Mathematics and Computation
Wanglong Yan, Xiaohua Shi, Xin Yan, Lina Wang,
... OpenCV SURF v2.4.5 CUDA implementation on NVidia's GTX660 and GTX460SE GPUs, repectively. Our OpenCL ... sizes from 320*240 to 1024*768 on NVidia's GTX660 GPU, NVidia's GTX460SE GPU and AMD's Radeon HD 6850 GPU. Our OpenCL approach on NVidia's GTX660 GPU is more than 22.8 ...
Tópico(s): Image Processing Techniques and Applications
2013 - SAGE Publishing | International Journal of Advanced Robotic Systems
... units, namely, multicore CPU, Intel Xeon Phi, and NVidia GPU. The algorithm operates on dense tensors (multidimensional ... CPU and the use of shared memory on NVidia GPU. From the applied side, the ultimate goal ... x86 CPUs and 2–3 times speedup on NVidia Tesla K20X GPU with respect to the naïve ...
Tópico(s): Computational Physics and Python Applications
2015 - Elsevier BV | Computer Physics Communications
Hassan H. Halawa, Hazem A. Abdelhafez, Andrew Boktor, Matei Ripeanu,
This study characterizes the NVIDIA Jetson TK1 and TX1 Platforms, both built on a NVIDIA Tegra System on Chip and combining a quad-core ARM CPU and an NVIDIA GPU. Their heterogeneous nature, as well as their ...
Tópico(s): Distributed and Parallel Computing Systems
2017 - Springer Science+Business Media | Lecture notes in computer science
Pedro Valero‐Lara, Ivan Martínez-Pérez, Raúl Sirvent, Xavier Martorell, Antonio J. Peña,
... that multiple studies have explored the use of NVIDIA GPUs to accelerate such computation. However, these studies ... of systems. The gtsvStridedBatch routine in the cuSPARSE NVIDIA package is one of these examples, which is ... 6× (in single precision) faster using the latest NVIDIA GPU architecture, the Pascal P100.
Tópico(s): Cloud Computing and Resource Management
2018 - Wiley | Concurrency and Computation Practice and Experience
Craig Warren, Antonios Giannopoulos, Alan Gray, Iraklis Giannakis, A. Patterson, Laura Wetter, Andre Hamrah,
... We designed optimal kernels for GPU execution using NVIDIA’s CUDA framework. Our GPU solver achieved performance ... 1194 Mcells/s and 3405 Mcells/s on NVIDIA Kepler and Pascal architectures, respectively. This is up ... We found the cost–performance benefit of the NVIDIA GeForce-series Pascal-based GPUs – targeted towards the ... has been written in CUDA for execution on NVIDIA GPUs. This is in addition to the existing ...
Tópico(s): Geophysical and Geoelectrical Methods
2018 - Elsevier BV | Computer Physics Communications
Yuchen Ma, Jiajia Li, Xiaolong Wu, Chenggang Yan, Jimeng Sun, Richard Vuduc,
... sparse and semi-sparse tensors on CPU and NVIDIA GPU platforms. Ttm is a computational kernel in ... its conventional approach. We further optimize SpTtm on NVIDIA GPU platforms. Five approaches including employing fine thread ... library. GPU-SpTtm obtains 6–19× speedup on NVIDIA K40c and 23–67× speedup on NVIDIA P100 over CPU-SpTtm respectively. Our GPU-SpTtm ...
Tópico(s): Parallel Computing and Optimization Techniques
2018 - Elsevier BV | Journal of Parallel and Distributed Computing
Ilya V. Afanasyev, Vadim Voevodin, Vladimir Voevodin, Kazuhiko Komatsu, Hiroaki Kobayashi,
... computational characteristics of three high performance architectures: two NVIDIA GPU architectures (of Pascal and Volta generations) and ... despite having vectorised data-processing included in both NVIDIA GPU and NEC SX-Aurora TSUBASA architectures, vectorisation ... comparable and the others showed different efficiency on NVIDIA GPUs and NEC SX-Aurora TSUBASA vector processors, ...
Tópico(s): Interconnection Networks and Systems
2019 - Springer Science+Business Media | Lecture notes in computer science
Yu‐Hsiang Tsai, Terry Cojean, Hartwig Anzt,
... implementations on high-end GPUs from AMD and NVIDIA. Specifically, we optimize SpMV kernels for the CSR, ... our kernels against AMD's hipSPARSE library and NVIDIA's cuSPARSE library, and ultimately assess how the GPU technologies from AMD and NVIDIA compare in terms of SpMV performance.
Tópico(s): Stochastic Gradient Optimization Techniques
2020 - Springer Science+Business Media | Lecture notes in computer science
David Adam, Tianyu Liu, P Caracappa, Bryan Bednarz, Xie George Xu,
... which is capable of being executed on CPUs, NVIDIA GPUs, and AMD GPUs. This capability of fast ... RT to allow for device independent execution on NVIDIA and AMD GPUs. Architecture‐specific atomic‐add algorithms ... Timing studies were conducted on a CPU, an NVIDIA GPU, and an AMD GPU to evaluate the ... 187.9, and 216.8 s on an NVIDIA GPU, AMD GPU, and Intel CPU, respectively. Conclusion ...
Tópico(s): Medical Imaging Techniques and Applications
2020 - Wiley | Medical Physics
Kyle A. O’Connell, Zelaikha Yosufzai, Ross Campbell, Collin J. Lobb, Haley T. Engelken, Laura Gorrell, Thad Carlson, Josh J. Catana, Dina Mikdadi, Vivien Bonazzi, Juergen Klenk,
... we benchmark one GPU-accelerated software suite called NVIDIA Parabricks on Amazon Web Services (AWS), Google Cloud Platform (GCP), and an NVIDIA DGX cluster. We benchmarked six variant calling pipelines, ... min on GCP, and 24 min on the NVIDIA DGX. Somatic callers exhibited more variation between the ...
Tópico(s): Gene expression and cancer classification
2023 - BioMed Central | BMC Bioinformatics
Adam Krzywaniak, Paweł Czarnul, Jerzy Proficz,
... power is introduced on one of the latest NVIDIA GPUs: a software tool called the Dynamic Energy- ... sum (EDS). The tool gathers power measurements from NVIDIA Management Library (NVML). Measuring the application progress at ... We have evaluated the DEPO tool on the NVIDIA RTX A4500 and A100 GPUs with machine learning ... to obtain energy savings exceeding 22% for both NVIDIA A100 and RTX A4500 GPUs while the performance ...
Tópico(s): Green IT and Sustainability
2023 - Elsevier BV | Future Generation Computer Systems
Kai Cao, Qizhong Wu, Lingling Wang, Nan Wang, Huaqiong Cheng, Xiao Tang, Dongqing Li, Lanning Wang,
... results show that running GPU-HADVPPM on one NVIDIA Tesla K40m and an NVIDIA Tesla V100 GPU can achieve up to a ... 1.3× and 18.8× speedup on an NVIDIA Tesla K40m GPU and NVIDIA Tesla V100 GPU, respectively. The multi-GPU acceleration ...
Tópico(s): Tropical and Extratropical Cyclones Research
2023 - Copernicus Publications | Geoscientific model development
Hamid Tabani, Fabio Mazzocchetti, Pedro Benedicte, Jaume Abella, Francisco J. Cazorla,
... with the aim of meeting these requirements, being NVIDIA Jetson TX2 and its high-performance successor, NVIDIA AGX Xavier, relevant representatives. However, to what extent ... on this question by modeling two recent automotive NVIDIA GPU-based platforms, namely TX2 and AGX Xavier. ...
Tópico(s): Embedded Systems Design Techniques
2021 - Elsevier BV | Journal of Parallel and Distributed Computing
... the performance of the concurrency mechanisms available on NVIDIA’s new Ampere GPU microarchitecture under deep learning ... thread block placement policies limits the effectiveness of NVIDIA’s concurrency mechanisms. In summary, the sequential nature ... and low, predictable turnaround times difficult on current NVIDIA hardware.
Tópico(s): Cloud Computing and Resource Management
2021 - Elsevier BV | Performance Evaluation
Jack Choquette, Wishwesh Gandhi, Olivier Giroux, Nick Stam, Ronny Krashinsky,
NVIDIA A100 Tensor Core GPU is NVIDIA's latest flagship GPU. It has been designed with many new innovative features to provide performance and capabilities for HPC, AI, and ... enhanced L2 cache, HBM2 DRAM, and third-generation NVIDIA NVLink I/O.
Tópico(s): Distributed and Parallel Computing Systems
2021 - Institute of Electrical and Electronics Engineers | IEEE Micro