INESC TEC
INESC TEC
INESC TEC
Search results for:
Filter your results

0 Search results

João Paiva Cardoso

João Paiva Cardoso

João M. P. Cardoso received his PhD degree in Electrical and Computer Engineering from the IST/UTL (Technical University of Lisbon), Lisbon, Portugal in 2001. He is currently Full Professor at the Department of Informatics Eng., Faculty of Eng. of the University of Porto, Porto, Portugal, and a research member of INESC TEC. Before, he was with the IST/UTL (2006-2008), a senior researcher at INESC-ID (2001-2009), and with the University of Algarve (1993-2006). In 2001/2002, he worked for PACT XPP Technologies, Inc., Munich, Germany. He has been involved in the organization and served as a Program Committee member for many international conferences. For example, he was general Co-Chair of IEEE/IFIP EUC’2015 and IEEE CSE’2015, General Chair of FPL’2013, General Co-Chair of ARC’2014 and ARC’2006, Program Co-Chair of ARCS’2016, DASIP’2014, and RAW’2010. He has (co-)authored over 150 scientific publications on subjects related to compilers, embedded systems, and reconfigurable computing. He has coordinated a number of research projects. He is a senior member of IEEE, a member of IEEE Computer Society, and a senior member of ACM.  His research interests include compilation techniques, domain-specific languages, reconfigurable computing, application-specific architectures, and high-performance computing with a particular emphasis in embedded computing.

Projects

PEPCC

The domains of embedded systems (ES) and high-performance computing (HPC) are usually seen as distant, but some of their requirements are converging: a modern ES runs complex algorithms with high computational power; the power consumption of ever larger HPC systems requires new levels of power efficiency ("green computing"). Both domains are gaining importance for the society as a whole and for the individual citizens,: complex embedded systems (including cyber-physical systems) have tangible impact on human well-being, natural resource management (smart homes, intelligent transportation) and industrial capabilities, while HPC systems are used to extract new knowledge from the mass of information acquired by ES and provide innovative services both to enterprises (including small and medium ones) and individuals (ranging from traffic management to computer-aided drug discovery). ES have adopted heterogeneous architectures, often based on reconfigurable accelerators like FPGAs or Coarse-Grained Reconfigurable Arrays (CGRAs), because they combine hardware specialization (improving performance and power efficiency) with adaptability at run-time. A similar trend towards heterogeneity is observed in HPC systems with GPUs as accelerators: HPC stakeholders have identified several challenges related to auto-tuning and self adapting systems, and power-aware resource management, matching the trends identified by ES stakeholders on the relevance of heterogeneous accelerators. Many applications in HPC and ES have a small number of regular computational kernels that account for most of the execution time and energy consumption. The manual introduction of reconfigurable accelerators requires significant design time and hardware expertise. It is vital to not compromise developer productivity by requiring manual hardware development and source code alterations. Therefore, this project focuses on approaches that only require knowledge about the program binary code and its runtime behavior. The goal of this project is to devise efficient techniques for dynamically mapping computations extracted from execution behavior to the resources of specialized reconfigurable accelerators. The techniques will identify at runtime the hotspots of program execution. They are then optimized and mapped to CGRAs tailored to the actual set of executing kernels. Whenever one hotspot needs to be executed, the accelerator is transparently invoked. The use of specialized CGRAs reduces resource usage and improves performance. The project will apply these concepts in the ES and HPC domains. Expected achievements include important performance improvements over CPU-based computation in the system (at least 3x on average), while maintaining almost the same energy consumption. The project expects to reduce energy consumption by at least 30% through reduction of the clock frequency of the CPU, while still matching the performance of the system without accelerator.

Power efficiency and performance for embedded and HPC systems with custom CGRAs

Publications

A Flexible-Granularity Task Graph Representation and Its Generation from C Applications (WIP)

Santos, T;Bispo, J;Cardoso, JMP;

2024

PROCEEDINGS OF THE 25TH ACM SIGPLAN/SIGBED INTERNATIONAL CONFERENCE ON LANGUAGES, COMPILERS, AND TOOLS FOR EMBEDDED SYSTEMS, LCTES 2024

A CPU-FPGA Holistic Source-To-Source Compilation Approach for Partitioning and Optimizing C/C plus plus Applications

Santos, T;Bispo, J;Cardoso, JMP;

2023

2023 32ND INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES, PACT

Preface ASAP 2023

Cardoso, JMP;Jimborean, A;Mentens, N;Coutinho, JGF;

2023

34th IEEE International Conference on Application-specific Systems, Architectures and Processors, ASAP 2023, Porto, Portugal, July 19-21, 2023

A DSL-based runtime adaptivity framework for Java

Carvalho, T;Bispo, J;Pinto, P;Cardoso, JMP;

2023

SOFTWAREX

View all publications

Supervised theses

ForestMP: Multimodal perception system for robotics in forestry applications

Daniel Queirós da Silva

D - 2023

UTAD-ECT