Buscar assunto

Portal de Periódicos da CAPES

Sobre
Acervo
Treinamentos
- Calendário
- Materiais de apoio
Informativos
Ajuda

Redes Sociais

Olá.

Escopo da Busca:

Filtros de busca

Tipo de Material

Limpar

Busca Avançada

6.368 resultados

Expandir meus resultados

Acesso aberto

Sim 3424

Não 2944

Tipo do recurso

Selecionar todos

Artigo

5309

Capítulo de livro

987

Carta

Revisão

Editorial

Paratexto

Conjunto de dados

Errata

Jornais

Livro

Outro

Ano de criação

Até

Produção nacional

Não 6255

Sim 113

Revisado por pares

Sim 5144

Não 1224

Áreas

Selecionar todos

Ciências Exatas e da Terra

4879

Multidisciplinar

2969

Engenharias

1628

Ciências da Saúde

879

Ciências Sociais Aplicadas

360

Ciências Biológicas

172

Ciências Humanas

150

Ciências Agrárias

Linguística, Letras e Artes

Idioma

Selecionar todos

Inglês

6291

Coreano

Português

Russo

Turco

Espanhol

Alemão

Francês

Indonésio

Japonês

Polonês

Catalão

Croata

Holandês

Lituano

Sueco

Editores

Selecionar todos

Elsevier BV

1001

Springer Science+Business Media

885

Institute of Electrical and Electronics Engineers

727

Multidisciplinary Digital Publishing Institute

402

Wiley

383

Association for Computing Machinery

244

SPIE

197

IOP Publishing

167

Springer Nature

164

SAGE Publishing

Springer International Publishing

Institution of Engineering and Technology

Taylor & Francis

Frontiers Media

EDP Sciences

American Chemical Society

American Institute of Physics

RELX Group (Netherlands)

Hindawi Publishing Corporation

Oxford University Press

World Scientific

Optica Publishing Group

BioMed Central

Trans Tech Publications

IEEE Computer Society

Public Library of Science

IGI Global

Inderscience Publishers

IOS Press

Institute of Electronics, Information and Communication Engineers

Mary Ann Liebert, Inc.

Copernicus Publications

Nature Portfolio

ACM SIGARCH

Lippincott Williams & Wilkins

Institute of Advanced Engineering and Science (IAES)

Institute of Physics

Acoustical Society of America

IEEE Sensors Council

Cambridge University Press

Radiological Society of North America

Science Press

Society for Industrial and Applied Mathematics

Emerald Publishing Limited

IEEE Magnetics Society

Science and Information Organization

De Gruyter

IEEE Antennas & Propagation Society

Scientific Research Publishing

AIP Publishing

Gale Group

Selecionar tudo

Filtrar

Exportar

BIBTEX RIS

Jornais Acesso aberto

1. News from 22/05/2003

Andrew Robson, Simon Barnes Chief Sports Writer, James Christopher's, Raymond Snoddy Media Editor, David Adams, Robert Cole, Matthew Syed, Andrew Pierce, John Whitelegg Senior policy officer, Adam Sherwin Media Reporter, Frances Gibb Legal Editor, Magnus Linklater, Robert Thicknesse, Mark Souster, Andrew de Berry, Ian Blackshaw, Richard Hobson, Ivo Tennant, Tessa Jowell, Philip Howard, Oliver Kay and George Caulkin, David Chater, James Bone, Nic Hopkins, Barrie Behenna, Clive Davis, Alexandra Frean Social Affairs Correspondent, Mark Griffiths Horticulture Correspondent, Patrick Kidd, Gillian Harris Scotland Correspondent, Ivan Lawrence (MP), Ginny Dougary, Barbara Ellen, Brian E. Saunders, Kevin Eason, Neil Howlett, Joe Joseph, Ben Webster Transport Correspondent, Peter Wyman, Valerie Elliott Countryside Editor, Tina Brown, Steve Keenan, Thrasy Petropoulos, Stewart Tendler Crime Correspondent, Jeremy Westwood, Jennai Cox Fitness Editor, Anthony Browne, Dominic Walsh, Debra Craine, Norman Harris, Daniel Finkelstein, Charles Heyman, Tim Reid, Dan Sabbagh, Dalya Alberge Arts Correspondent, Robert Atkins, Ben Macintyre Parlimentary Sketch, Phoebe Greenwood, Lisa Murch, Alan Hamilton, Oliver Wright, Ian Cobain and Michael Evans, Jenny MacArthur, Demetrios Matheou, Anatole Kaletsky, Phil Gordon, Roland Watson, Richard Lloyd Parry, Ron Lewis, Peter D. Rossdale (Editor), Jonathan Porter, Matt Dickinson Chief Football Correspondent, Caroline Merrell Banking Correspondent, Angela Jameson, Tony Halpin Education Editor, Daniel McGrory, Oliver Kay, John Westerby, Tom Baldwin, Mark Henderson Science Correspondent, Sam Coates, Jill Sherman Whitehall Editor, Christopher Martin-Jenkins, John Hopkins Golf Correspondent, Rob Wright, Geoffrey Dean, Elaine Monaghan, Gary Duncan and Joe Bolger, Dan Sabbagh Telecoms Correspondent, Raymond Keene, Lewis Smith, Matthew Pryor, Richard Miles Investment Editor, Matt Dickinson, Sean MacAulay, Philip Whiteley, Peter Kimm, David Walton, Tom McIntyre, Gwen Staveley Teacher, George Caulkin and Peter Lansley, Benji Wilson, Chris Partridge, Beryl Dixon, Greg Hurst Parliamentary Correspondent, Nigel Hawkes, Peta Bee, Neville Scott, Chris Campling, Bronwen Maddox, Richard Owen, Tony Halpin and Glen Owen, Christopher Martin-Jenkins Chief Cricket Correspondent, Roger Boyes, David Mattin, Oliver August, Peter Dixon, Nick Hasell, Joanne Hart, Updesh Kapur, David Hands Rugby Correspondent, Glen Owen, Mike Mulvihill, John Goodbody, Philip Webster Political Editor, Melissa Kite Political Correspondent, Christopher Walker, Des Dearlove, Ian Johns, Mike Mason, John Crossland, Anjana Ahuja, Ian Cobain David Lister and Gabriel Rozenberg, Chris McGrath, Caitlin Moran, Ingrid Mansell, Alan Lee, Peter Riddell, Oliver Chastney, Nigel Massen, Tony Dawe, David Charter Chief Political Correspondent, Michael Dynes, Jack Malvern Arts Reporter, Gary Duncan Economics Correspondent, Martin Waller, Chris Ayres, Peter Inson (Headmaster), Mark Baldwin, Nigel Hawkes Health Editor, Sarah Butler, Benedict Nightingale, Dr Thomas Stuttaford, Stephen Dalton, Leo Lewis, Alyson Rudd, Frances Gibb, Richard Morrison, Abigail Rayner, Julian Ryall, Mark Thomas, Owen Slot, Robbie Millen, Jenny Davey, Colin McQuillan, Robin Shepherd, Patience Wheatcroft, David Alston, Lisa Verrico, David Lister Ireland Correspondent,

... CV's Home Based Business CMR OCPA Regulated NVIDIA Odgers HIGH growth Boxwood BSi Wragge&Co TAL ...

2003 - Gale Group | TDA

Ver no editor

Artigo Acesso aberto

Produção Nacional Revisado por pares

2. OpenMP, OpenMP/MPI, and CUDA/MPI C programs for solving the time-dependent dipolar Gross–Pitaevskii equation

Vladimir Lončar, Luis E. Young-S., Srdjan Škrbić, Paulsamy Muruganandam, Sadhan K. Adhikari, Antun Balaž,

... as that a computer or a cluster has Nvidia GPU with Compute Capability 2.0 or higher, ... used for OpenMP/MPI version, and all available Nvidia GPUs across all cluster nodes used for CUDA/ ... typical modern computers. A parallel implementation exists, using Nvidia CUDA [2], and both versions are already used ... cores) with 32 GB of RAM and one Nvidia Tesla M2090 GPU with 6 GB of RAM, ... of DBEC-GP-MPI-CUDA programs compiled with Nvidia's nvcc compiler, with CUDA-aware OpenMPI implementation ... org/fftw3_doc/Real_002ddata-DFTs.html (2014); Nvidia's cuFFT accuracy and performance, http://docs.nvidia. ...

Tópico(s): Cold Atom Physics and Bose-Einstein Condensates

2016 - Elsevier BV | Computer Physics Communications

Ver no editor

Computer Physics Communications arXiv (Cornell University) Americanae (AECID Library) arXiv (Cornell University) DataCite API

Artigo Revisado por pares

3. Efficient fingerprint matching using GPU

Mubeen Ghafoor, Shahzaib Iqbal, Syed Ali Tariq, Imtiaz Ahmad Taj, Noman M. Jafri,

... The GPU technology has been revolutionised, especially when NVIDIA [23-25] introduced 'compute unified device architecture' (CUDA) ... optimised implementation of MCC-based fingerprint matcher on NVIDIA's Tesla and GeForce GPUs and claimed a ... Section 2. Section 3 discusses the GPU and NVIDIA CUDA architecture. Section 4 discusses the proposed implementation ... brief overview of the GPU architecture and introduces NVIDIA CUDA programming architecture. 3 GPU and NVIDIA CUDA architecture To transform or map CPU algorithm ...

Tópico(s): Forensic Fingerprint Detection Methods

2017 - Institution of Engineering and Technology | IET Image Processing

Ver no editor

IET Image Processing

Artigo Revisado por pares

4. Optimizing the multipole‐to‐local operator in the fast multipole method for graphical processing units

Toru TAKAHASHI, Cris Cecka, William Fong, Eric Darve,

... to run the fast multipole method (FMM) on NVIDIA CUDA‐capable graphical processing units (GPUs) (Nvidia Corporation, Sta. Clara, CA, USA). The FMM is ... achieved performance over 200 Gflop/s on one NVIDIA Tesla C1060 GPU (Nvidia Corporation, Sta. Clara, CA, USA). This was compared ... cache misses. We also present benchmarks on an NVIDIA C2050 GPU (a Fermi processor)(Nvidia Corporation, Sta. Clara, CA, USA) in single and ...

Tópico(s): Antenna Design and Analysis

2011 - Wiley | International Journal for Numerical Methods in Engineering

Ver no editor

International Journal for Numerical Methods in Engineering

Artigo Acesso aberto Revisado por pares

5. ApesNet: a pixel‐wise efficient segmentation network for embedded devices

Chunpeng Wu, Hsin‐Pai Cheng, Sicheng Li, Hai Li, Yiran Chen,

... in this section. Our GPU device is the NVIDIA TITAN X and cuDNN v5 [17] is adopted. ... our method can be further speed up using NVIDIA's techniques such as dynamic parallelism and hyper- ... blue) on Cityscapes. The GPU device used is NVIDIA GTX 1080. There are two ApesBlock and one ... JMLR), 2015, 37, pp. 1- 9 17https://developer.nvidia.com/cudnn 18Simonyan, K., Zisserman, A.: ' Very deep ... performance on ImageNet classification', arxiv, 2015 30https://www.nvidia.com/content/PDF/kepler/NVIDIA-Kepler-GK110-Architecture- ...

Tópico(s): Autonomous Vehicle Technology and Safety

2016 - Institution of Engineering and Technology | IET Cyber-Physical Systems Theory & Applications

Ver no editor

IET Cyber-Physical Systems Theory & Applications

Artigo Acesso aberto Revisado por pares

6. Numerical behavior of NVIDIA tensor cores

Massimiliano Fasi, Nicholas J. Higham, Mantas Mikaitis, Srikara Pranesh,

... explore the floating-point arithmetic implemented in the NVIDIA tensor cores, which are hardware accelerators for mixed- ... are normalized. These aspects are not documented by NVIDIA, and we gain insight by running carefully designed ... important if one wishes to: (1) accurately simulate NVIDIA tensor cores on conventional hardware; (2) understand the ... build custom hardware whose behavior matches that of NVIDIA tensor cores. As part of this work we ... easily adapted to test newer versions of the NVIDIA tensor cores as well as similar accelerators from ...

Tópico(s): Low-power high-performance VLSI design

2021 - PeerJ, Inc. | PeerJ Computer Science

Ver no editor

PeerJ Computer Science DOAJ (DOAJ: Directory of Open Access Journals) PubMed Central MIMS EPrints (University of Southampton) MIMS EPrints (University of Southampton) MIMS EPrints (University of Southampton) MIMS EPrints (University of Southampton)

Artigo Acesso aberto Revisado por pares

7. Lightweight Fruit-Detection Algorithm for Edge Computing Applications

Wenli Zhang, Yuxin Liu, Kaizhen Chen, Huibin Li, Yulin Duan, Wenbin Wu, Yun Shi, Wei Guo,

... proposed algorithm was tested on three edge devices: NVIDIA Jetson Xavier NX, NVIDIA Jetson TX2, and NVIDIA Jetson NANO. The experimental results show that the ... respectively. Deploying the algorithm, the detection speed of NVIDIA Jetson Xavier NX reaches 21.3, 24.8, and 22.2 FPS, while that of NVIDIA Jetson TX2 reaches 13.9, 14.1, and 14.5 FPS and that of NVIDIA Jetson NANO reaches 6.3, 5.0, and ...

Tópico(s): Remote Sensing in Agriculture

2021 - Frontiers Media | Frontiers in Plant Science

Ver no editor

Frontiers in Plant Science DOAJ (DOAJ: Directory of Open Access Journals) Europe PMC (PubMed Central) PubMed Central PubMed

Artigo Acesso aberto Revisado por pares

8. Accelerating Monte Carlo simulations with an NVIDIA® graphics processor

Paul Martinsen, Johannes Blaschke, Rainer Künnemeyer, R. B. Jordan,

... in turbid media has been implemented on an NVIDIA® 8800gt graphics card using the CUDA toolkit. The ... Designed for Intel PCs. Phoogle-G requires a NVIDIA graphics card with support for CUDA 1.1 ... PC and a consumer grade graphics card from NVIDIA. Restrictions: The graphics card implementation uses single precision ... optical properties of the medium. References: http://www.nvidia.com/object/cuda_home.html. S. Prahl, M. ...

Tópico(s): Digital Radiography and Breast Imaging

2009 - Elsevier BV | Computer Physics Communications

Ver no editor

Computer Physics Communications Research Commons (The University of Waikato)

Artigo Acesso aberto Revisado por pares

9. Hybrid of genetic algorithm and local search to solve MAX-SAT problem using nVidia CUDA framework

Asim Munawar, Mohamed Wahib, Masaharu Munetomo, Kiyoshi Akama,

... SAT) problem on a state-of-the-art nVidia Tesla GPU using nVidia Compute Unified Device Architecture (CUDA). MAX-SAT is ... used for an efficient implementation of GAs over nVidia GPUs. We also design and introduce new techniques/ ... GAs and LS over such architectures. We use nVidia Tesla C1060 to perform several numerical tests and ...

Tópico(s): Algorithms and Data Compression

2009 - Springer Science+Business Media | Genetic Programming and Evolvable Machines

Ver no editor

Genetic Programming and Evolvable Machines Hokkaido University Collection of Scholarly and Academic Papers (Hokkaido University)

Artigo Revisado por pares

10. Swan: A tool for porting CUDA programs to OpenCL

M J Harvey, Gianni De Fabritiis,

... programming model supported exclusively by GPUs manufactured by NVIDIA. An industry standardisation effort has recently produced the ... shown to have platform independence, running on both NVIDIA and AMD GPUs without modification. We conclude that ... RAM: 256 Mbytes Classification: 6.5 External routines: NVIDIA CUDA, OpenCL Nature of problem: Graphical Processing Units (GPUs) from NVIDIA are preferentially programed with the proprietary CUDA programming ... to CUDA and is also supported on non-NVIDIA hardware (including multicore ×86 CPUs, AMD GPUs and ...

Tópico(s): Software Testing and Debugging Techniques

2011 - Elsevier BV | Computer Physics Communications

Ver no editor

Computer Physics Communications

Artigo Revisado por pares

11. AutoSegEdge: Searching for the edge device real-time semantic segmentation based on multi-task learning

Ziwen Dou, Dong Ye, Boya Wang,

... further boost accuracy. Finally, we accelerated AutoSegEdge using NVIDIA's TensorRT and deployed it on the Nvidia Jetson NX. Experiments demonstrate that multi-objectives NAS ... to obtain the best result on a single Nvidia Tesla V100 GPU. On the Cityscapes dataset, AutoSegEdge ... 70.3% with 16.6 FPS on the Nvidia Jetson NX (and 194.54 FPS on an Nvidia Tesla V100 GPU) at the original resolution (1024 × ...

Tópico(s): Domain Adaptation and Few-Shot Learning

2023 - Elsevier BV | Image and Vision Computing

Ver no editor

Image and Vision Computing

Artigo Acesso aberto Revisado por pares

12. Run Your Visual-Inertial Odometry on NVIDIA Jetson: Benchmark Tests on a Micro Aerial Vehicle

Jinwoo Jeon, Sungwook Jung, Eungchang Mason Lee, Duckyu Choi, Hyun Myung,

... tests of various visual(-inertial) odometry algorithms on NVIDIA Jetson platforms. The compared algorithms include mono and ... and weight is limited. Jetson boards released by NVIDIA satisfy these constraints as they have a sufficiently ... study compares representative VO/VIO algorithms on several NVIDIA Jetson platforms, namely NVIDIA Jetson TX2, Xavier NX, and AGX Xavier, and ...

Tópico(s): Inertial Sensor and Navigation

2021 - Institute of Electrical and Electronics Engineers | IEEE Robotics and Automation Letters

Ver no editor

IEEE Robotics and Automation Letters arXiv (Cornell University)

Artigo Revisado por pares

13. Programming Massively Parallel Processors. A Hands-on Approach

Jie Cheng,

... which is a parallel programming environment supported on NVIDIA GPUs, and emulated on less parallel CPUs. Given ... and theoretical. The authors are both affiliated with NVIDIA; Kirk is an NVIDIA Fellow and Hwu is principle investigator for the first NVIDIA CUDA Center of Excellence at the University of ...

Tópico(s): Cloud Computing and Resource Management

2010 - | Scalable Computing Practice and Experience

Ver no editor

Scalable Computing Practice and Experience

Artigo Revisado por pares

14. Approximate weighted matching on emerging manycore and multithreaded architectures

Mahantesh Halappanavar, John Feo, Oreste Villa, Antonino Tumeo, Alex Pothen,

... multicore (Intel Nehalem and AMD Magny-Cours), manycore (Nvidia Tesla and Nvidia Fermi), and massively multithreaded (Cray XMT) platforms. We ... cores of Intel Nehalem, [Formula: see text] on Nvidia Tesla and [Formula: see text] on Nvidia Fermi relative to one core of Intel Nehalem, ...

Tópico(s): Caching and Content Delivery

2012 - SAGE Publishing | The International Journal of High Performance Computing Applications

Ver no editor

The International Journal of High Performance Computing Applications

Artigo Revisado por pares

15. Collision detection of convex polyhedra on the NVIDIA GPU architecture for the discrete element method

N. Govender, Daniël N. Wilke, Schalk Kok,

... and heuristics that are optimized for the parallel NVIDIA Kepler GPU architecture in detail. This includes a ... addition, we present heuristics optimized for the parallel NVIDIA Kepler GPU architecture. Our algorithms have minimalistic memory ... by simulating 34 million polyhedra on a single NVIDIA K6000 GPU. We show that by using the ...

Tópico(s): Mineral Processing and Grinding

2014 - Elsevier BV | Applied Mathematics and Computation

Ver no editor

Applied Mathematics and Computation

Artigo Acesso aberto Revisado por pares

16. Computing OpenSURF on OpenCL and General Purpose GPU

Wanglong Yan, Xiaohua Shi, Xin Yan, Lina Wang,

... OpenCV SURF v2.4.5 CUDA implementation on NVidia's GTX660 and GTX460SE GPUs, repectively. Our OpenCL ... sizes from 320*240 to 1024*768 on NVidia's GTX660 GPU, NVidia's GTX460SE GPU and AMD's Radeon HD 6850 GPU. Our OpenCL approach on NVidia's GTX660 GPU is more than 22.8 ...

Tópico(s): Image Processing Techniques and Applications

2013 - SAGE Publishing | International Journal of Advanced Robotic Systems

Ver no editor

International Journal of Advanced Robotic Systems DOAJ (DOAJ: Directory of Open Access Journals)

Artigo Acesso aberto Revisado por pares

17. An efficient tensor transpose algorithm for multicore CPU, Intel Xeon Phi, and NVidia Tesla GPU

Dmitry I. Lyakh,

... units, namely, multicore CPU, Intel Xeon Phi, and NVidia GPU. The algorithm operates on dense tensors (multidimensional ... CPU and the use of shared memory on NVidia GPU. From the applied side, the ultimate goal ... x86 CPUs and 2–3 times speedup on NVidia Tesla K20X GPU with respect to the naïve ...

Tópico(s): Computational Physics and Python Applications

2015 - Elsevier BV | Computer Physics Communications

Ver no editor

Computer Physics Communications OSTI OAI (U.S. Department of Energy Office of Scientific and Technical Information) OSTI OAI (U.S. Department of Energy Office of Scientific and Technical Information)

Capítulo de livro Revisado por pares

18. NVIDIA Jetson Platform Characterization

Hassan H. Halawa, Hazem A. Abdelhafez, Andrew Boktor, Matei Ripeanu,

This study characterizes the NVIDIA Jetson TK1 and TX1 Platforms, both built on a NVIDIA Tegra System on Chip and combining a quad-core ARM CPU and an NVIDIA GPU. Their heterogeneous nature, as well as their ...

Tópico(s): Distributed and Parallel Computing Systems

2017 - Springer Science+Business Media | Lecture notes in computer science

Ver no editor

Lecture notes in computer science

Artigo Acesso aberto Revisado por pares

19. cuThomasBatch and cuThomasVBatch, CUDA Routines to compute batch of tridiagonal systems on NVIDIA GPUs

Pedro Valero‐Lara, Ivan Martínez-Pérez, Raúl Sirvent, Xavier Martorell, Antonio J. Peña,

... that multiple studies have explored the use of NVIDIA GPUs to accelerate such computation. However, these studies ... of systems. The gtsvStridedBatch routine in the cuSPARSE NVIDIA package is one of these examples, which is ... 6× (in single precision) faster using the latest NVIDIA GPU architecture, the Pascal P100.

Tópico(s): Cloud Computing and Resource Management

2018 - Wiley | Concurrency and Computation Practice and Experience

Ver no editor

Concurrency and Computation Practice and Experience Zenodo (CERN European Organization for Nuclear Research) UPCommons institutional repository (Universitat Politècnica de Catalunya)

Artigo Acesso aberto Revisado por pares

20. A CUDA-based GPU engine for gprMax: Open source FDTD electromagnetic simulation software

Craig Warren, Antonios Giannopoulos, Alan Gray, Iraklis Giannakis, A. Patterson, Laura Wetter, Andre Hamrah,

... We designed optimal kernels for GPU execution using NVIDIA’s CUDA framework. Our GPU solver achieved performance ... 1194 Mcells/s and 3405 Mcells/s on NVIDIA Kepler and Pascal architectures, respectively. This is up ... We found the cost–performance benefit of the NVIDIA GeForce-series Pascal-based GPUs – targeted towards the ... has been written in CUDA for execution on NVIDIA GPUs. This is in addition to the existing ...

Tópico(s): Geophysical and Geoelectrical Methods

2018 - Elsevier BV | Computer Physics Communications

Ver no editor

Computer Physics Communications Aberdeen University Research Archive (Aberdeen University) Edinburgh Research Explorer (University of Edinburgh) Northumbria Research Link (Northumbria University) Edinburgh Research Explorer (University of Edinburgh) UWL Repository (University of West London) Northumbria Research Link (Northumbria University)

Artigo Acesso aberto Revisado por pares

21. Optimizing sparse tensor times matrix on GPUs

Yuchen Ma, Jiajia Li, Xiaolong Wu, Chenggang Yan, Jimeng Sun, Richard Vuduc,

... sparse and semi-sparse tensors on CPU and NVIDIA GPU platforms. Ttm is a computational kernel in ... its conventional approach. We further optimize SpTtm on NVIDIA GPU platforms. Five approaches including employing fine thread ... library. GPU-SpTtm obtains 6–19× speedup on NVIDIA K40c and 23–67× speedup on NVIDIA P100 over CPU-SpTtm respectively. Our GPU-SpTtm ...

Tópico(s): Parallel Computing and Optimization Techniques

2018 - Elsevier BV | Journal of Parallel and Distributed Computing

Ver no editor

Journal of Parallel and Distributed Computing

Capítulo de livro Revisado por pares

22. Analysis of Relationship Between SIMD-Processing Features Used in NVIDIA GPUs and NEC SX-Aurora TSUBASA Vector Processors

Ilya V. Afanasyev, Vadim Voevodin, Vladimir Voevodin, Kazuhiko Komatsu, Hiroaki Kobayashi,

... computational characteristics of three high performance architectures: two NVIDIA GPU architectures (of Pascal and Volta generations) and ... despite having vectorised data-processing included in both NVIDIA GPU and NEC SX-Aurora TSUBASA architectures, vectorisation ... comparable and the others showed different efficiency on NVIDIA GPUs and NEC SX-Aurora TSUBASA vector processors, ...

Tópico(s): Interconnection Networks and Systems

2019 - Springer Science+Business Media | Lecture notes in computer science

Ver no editor

Lecture notes in computer science

Capítulo de livro Acesso aberto Revisado por pares

23. Sparse Linear Algebra on AMD and NVIDIA GPUs – The Race Is On

Yu‐Hsiang Tsai, Terry Cojean, Hartwig Anzt,

... implementations on high-end GPUs from AMD and NVIDIA. Specifically, we optimize SpMV kernels for the CSR, ... our kernels against AMD's hipSPARSE library and NVIDIA's cuSPARSE library, and ultimately assess how the GPU technologies from AMD and NVIDIA compare in terms of SpMV performance.

Tópico(s): Stochastic Gradient Optimization Techniques

2020 - Springer Science+Business Media | Lecture notes in computer science

Ver no editor

Lecture notes in computer science

Artigo Revisado por pares

24. New capabilities of the Monte Carlo dose engine ARCHER‐RT: Clinical validation of the Varian TrueBeam machine for VMAT external beam radiotherapy

David Adam, Tianyu Liu, P Caracappa, Bryan Bednarz, Xie George Xu,

... which is capable of being executed on CPUs, NVIDIA GPUs, and AMD GPUs. This capability of fast ... RT to allow for device independent execution on NVIDIA and AMD GPUs. Architecture‐specific atomic‐add algorithms ... Timing studies were conducted on a CPU, an NVIDIA GPU, and an AMD GPU to evaluate the ... 187.9, and 216.8 s on an NVIDIA GPU, AMD GPU, and Intel CPU, respectively. Conclusion ...

Tópico(s): Medical Imaging Techniques and Applications

2020 - Wiley | Medical Physics

Ver no editor

Medical Physics

Artigo Acesso aberto Revisado por pares

25. Accelerating genomic workflows using NVIDIA Parabricks

Kyle A. O’Connell, Zelaikha Yosufzai, Ross Campbell, Collin J. Lobb, Haley T. Engelken, Laura Gorrell, Thad Carlson, Josh J. Catana, Dina Mikdadi, Vivien Bonazzi, Juergen Klenk,

... we benchmark one GPU-accelerated software suite called NVIDIA Parabricks on Amazon Web Services (AWS), Google Cloud Platform (GCP), and an NVIDIA DGX cluster. We benchmarked six variant calling pipelines, ... min on GCP, and 24 min on the NVIDIA DGX. Somatic callers exhibited more variation between the ...

Tópico(s): Gene expression and cancer classification

2023 - BioMed Central | BMC Bioinformatics

Ver no editor

BMC Bioinformatics PubMed Central PubMed

Artigo Revisado por pares

26. Dynamic GPU power capping with online performance tracing for energy efficient GPU computing using DEPO tool

Adam Krzywaniak, Paweł Czarnul, Jerzy Proficz,

... power is introduced on one of the latest NVIDIA GPUs: a software tool called the Dynamic Energy- ... sum (EDS). The tool gathers power measurements from NVIDIA Management Library (NVML). Measuring the application progress at ... We have evaluated the DEPO tool on the NVIDIA RTX A4500 and A100 GPUs with machine learning ... to obtain energy savings exceeding 22% for both NVIDIA A100 and RTX A4500 GPUs while the performance ...

Tópico(s): Green IT and Sustainability

2023 - Elsevier BV | Future Generation Computer Systems

Ver no editor

Future Generation Computer Systems

Artigo Acesso aberto Revisado por pares

27. GPU-HADVPPM V1.0: a high-efficiency parallel GPU design of the piecewise parabolic method (PPM) for horizontal advection in an air quality model (CAMx V6.10)

Kai Cao, Qizhong Wu, Lingling Wang, Nan Wang, Huaqiong Cheng, Xiao Tang, Dongqing Li, Lanning Wang,

... results show that running GPU-HADVPPM on one NVIDIA Tesla K40m and an NVIDIA Tesla V100 GPU can achieve up to a ... 1.3× and 18.8× speedup on an NVIDIA Tesla K40m GPU and NVIDIA Tesla V100 GPU, respectively. The multi-GPU acceleration ...

Tópico(s): Tropical and Extratropical Cyclones Research

2023 - Copernicus Publications | Geoscientific model development

Ver no editor

Geoscientific model development

Artigo Acesso aberto Revisado por pares

28. Performance Analysis and Optimization Opportunities for NVIDIA Automotive GPUs

Hamid Tabani, Fabio Mazzocchetti, Pedro Benedicte, Jaume Abella, Francisco J. Cazorla,

... with the aim of meeting these requirements, being NVIDIA Jetson TX2 and its high-performance successor, NVIDIA AGX Xavier, relevant representatives. However, to what extent ... on this question by modeling two recent automotive NVIDIA GPU-based platforms, namely TX2 and AGX Xavier. ...

Tópico(s): Embedded Systems Design Techniques

2021 - Elsevier BV | Journal of Parallel and Distributed Computing

Ver no editor

Journal of Parallel and Distributed Computing arXiv (Cornell University) arXiv (Cornell University) DataCite API

Artigo Acesso aberto Revisado por pares

29. Characterizing concurrency mechanisms for NVIDIA GPUs under deep learning workloads

Guin Gilman, Robert J. Walls,

... the performance of the concurrency mechanisms available on NVIDIA’s new Ampere GPU microarchitecture under deep learning ... thread block placement policies limits the effectiveness of NVIDIA’s concurrency mechanisms. In summary, the sequential nature ... and low, predictable turnaround times difficult on current NVIDIA hardware.

Tópico(s): Cloud Computing and Resource Management

2021 - Elsevier BV | Performance Evaluation

Ver no editor

Performance Evaluation arXiv (Cornell University)

Artigo Revisado por pares

30. NVIDIA A100 Tensor Core GPU: Performance and Innovation

Jack Choquette, Wishwesh Gandhi, Olivier Giroux, Nick Stam, Ronny Krashinsky,

NVIDIA A100 Tensor Core GPU is NVIDIA's latest flagship GPU. It has been designed with many new innovative features to provide performance and capabilities for HPC, AI, and ... enhanced L2 cache, HBM2 DRAM, and third-generation NVIDIA NVLink I/O.

Tópico(s): Distributed and Parallel Computing Systems

2021 - Institute of Electrical and Electronics Engineers | IEEE Micro

Ver no editor

IEEE Micro

Exibir

1–30 de 6.368 itens

Página

Entrar

Lembrar minha senha

Receber meu e-mail de confirmação