Program
The 13th International Symposium on Applied Reconfigurable Computing (ARC2017) is held from Monday the 3rd of April to Friday the 7th of April 2017.
On Monday and Friday, there will be tutorials. The main symposium will be held on Tuesday, Wednesday and Thursday.
The schedule below shows the different category sessions in their respective timeslots. You may click the links in the schedule to go to the respective description in the detailed program. Or click here to go to the detailed program directly.
ARC2017 Time Schedule:
April 3 Monday |
April 4 Tuesday |
April 5 Wednesday |
April 6 Thursday |
April 7 Friday |
|
---|---|---|---|---|---|
09:00 | Tutorial: PYNQ (Xilinx) |
Keynote 1: Onur Mutlu (ETH Zurich) |
Keynote 2: Walid Najjar (UC Riverside) |
Keynote 3: Cathal McCabe (Xilinx) |
Tutorial: ρ-VEX |
10:00 | Break | ||||
10:30 | Tutorial: PYNQ (cont.) (Xilinx) |
Session: Adaptive Architectures |
Session: Design Space Exploration |
Session: FPGA based design |
Tutorial: ρ-VEX (cont.) |
12:00 |
Lunch | ||||
14:00 | Tutorial: PYNQ (cont.) (Xilinx) |
Session: Embedded Computing and Security |
Session: Fault Tolerance |
Session: Neural Networks |
Tutorial: ρ-VEX (cont.) |
15:30 | Break | ||||
16:00 | Tutorial: PYNQ (cont.) (Xilinx) |
Session: Simulation and Synthesis |
Social Event (including dinner) |
Session: Languages and Estimation Techniques |
Tutorial: ρ-VEX (cont.) |
Keynotes
(and the Computing Platforms We Design Around It)
Abstract: The memory system is a fundamental performance and energy bottleneck in almost all computing systems. Recent system design, application, and technology trends that require more capacity, bandwidth, efficiency, and predictability out of the memory system make it an even more important system bottleneck. At the same time, DRAM and flash technologies are experiencing difficult technology scaling challenges that make the maintenance and enhancement of their capacity, energy efficiency, and reliability significantly more costly with conventional techniques. In fact, recent reliability issues with DRAM, such as the RowHammer problem, are already threatening system security and predictability.
In this talk, we first discuss major challenges facing modern memory systems in the presence of greatly increasing demand for data and its fast analysis. We then examine some promising research and design directions to overcome these challenges and thus enable scalable memory systems for the future. We discuss three key solution directions: 1) enabling new memory architectures, functions, interfaces, and better integration of memory and the rest of the system, 2) designing a memory system that intelligently employs emerging non-volatile memory (NVM) technologies and coordinates memory and storage management, 3) reducing memory interference and providing predictable performance to applications sharing the memory system. If time permits, we will also touch upon our ongoing related work in combating scaling challenges of NAND flash memory.
An accompanying paper, slightly outdated (circa 2015), can be found here.


The framework combines three main elements:
- the use of a high-level productivity language, Python in this case
- Python-callable hardware libraries based on FPGA overlays
- a web-based architecture incorporating the open-source Jupyter Notebook infrastructure served from Zynq's embedded processors

Cathal McCabe is a senior applications engineer in the Xilinx CTO (Chief Technology Officer) department. He based in the Xilinx European HQ in Dublin, Ireland, and manages the Xilinx University Program in EMEA.
Alongside his existing responsibilities, Cathal has been part of the development team in Xilinx working on hardware and software architectures for PYNQ.
Tutorials
PYNQ Workshop
Xilinx
Monday, April 3, 10:30
PYNQ is an open-source framework that enables programmers who want to use embedded systems to exploit the capabilities of Xilinx Zynq All Programmable SoCs (APSoC). It allows users to exploit custom hardware in the programmable logic without having to use ASIC-style CAD tools. Instead the APSoC is programmed in Python and the code is developed and tested directly on the embedded system. The programmable logic circuits are imported as hardware libraries and programmed through their APIs, in essentially the same way that software libraries are imported and programmed.
The framework combines four main elements: (1) the use of a high-level productivity language, Python in this case; (2) Python-callable hardware libraries based on FPGA overlays; (3) a web-based architecture incorporating the open-source Jupyter Notebook infrastructure served from Zynq's embedded processors; and (4) Jupyter Notebook's client-side, web apps. The result is a web-centric programming environment that enables software programmers to work at higher levels of design abstraction and to re-use both software and hardware libraries.
This tutorial will give a hands-on introduction to PYNQ framework. It will feature the latest version of PYNQ with Python 3.6 and Asyncio support for processor and fabric interrupts. Several new overlays will be introduced along with examples of overlay creation and binding into the PYNQ framework.
ρ-VEX Tutorial
Computer Engineering Laboratory, Delft University of Technology
Friday, April 7, 10:00
On the last day of ARC, a tutorial is organized to familiarize the participants with the ρ-VEX platform that is developed at Delft University of Technology. It is an open-source implementation of a design-time reconfigurable and run-time parameterizable VLIW processor. Design-time reconfigurability is realized by the highly generic VHDL code. It comes with a complete toolchain, simulator, debug & trace hardware and interfacing software.
The tutorial will highlight 2 use cases of the platform;
- The FPGA prototype of the dynamic core
- An FPGA overlay fabric consisting of 64 cores running on 200MHz targeting streaming image processing workloads
There will also be room for participants to port their application of interest to one (or both) the platforms to experiment with either the reconfigurable properties or the streaming fabric under guidance of the ρ-VEX developers. We have an industrial grade compiler, floating point emulation, math and C standard libraries and a simply Linux port, so we expect to be able to run most applications that are not too complex.
For more information about the platform, see http://rvex.ewi.tudelft.nl
A full release (4.1) is available on the site if you wish to do some experiments before the tutorial.
Preliminary Program:
- Intro
- Demos
- Release download & setup
- Lunch
- Hands-on running programs (compilation, simulation, synthesis & circuit simulation, run on board)
- Dynamic core
- Streaming platform
- Maybe some larger programs that need OS support (FreeRTOS/Linux)
- (configuration) Scheduling
- Running participant's code
PYNQ is an open-source framework that enables programmers who want to use embedded systems to exploit the capabilities of Xilinx Zynq All Programmable SoCs (APSoC). It allows users to exploit custom hardware in the programmable logic without having to use ASIC-style CAD tools. Instead the APSoC is programmed in Python and the code is developed and tested directly on the embedded system. The programmable logic circuits are imported as hardware libraries and programmed through their APIs, in essentially the same way that software libraries are imported and programmed.
The framework combines four main elements: (1) the use of a high-level productivity language, Python in this case; (2) Python-callable hardware libraries based on FPGA overlays; (3) a web-based architecture incorporating the open-source Jupyter Notebook infrastructure served from Zynq's embedded processors; and (4) Jupyter Notebook's client-side, web apps. The result is a web-centric programming environment that enables software programmers to work at higher levels of design abstraction and to re-use both software and hardware libraries.
This tutorial will give a hands-on introduction to PYNQ framework. It will feature the latest version of PYNQ with Python 3.6 and Asyncio support for processor and fabric interrupts. Several new overlays will be introduced along with examples of overlay creation and binding into the PYNQ framework.
On the last day of ARC, a tutorial is organized to familiarize the participants with the ρ-VEX platform that is developed at Delft University of Technology. It is an open-source implementation of a design-time reconfigurable and run-time parameterizable VLIW processor. Design-time reconfigurability is realized by the highly generic VHDL code. It comes with a complete toolchain, simulator, debug & trace hardware and interfacing software.
The tutorial will highlight 2 use cases of the platform; - The FPGA prototype of the dynamic core - An FPGA overlay fabric consisting of 64 cores running on 200MHz targeting streaming image processing workloads
There will also be room for participants to port their application of interest to one (or both) the platforms to experiment with either the reconfigurable properties or the streaming fabric under guidance of the ρ-VEX developers. We have an industrial grade compiler, floating point emulation, math and C standard libraries and a simply Linux port, so we expect to be able to run most applications that are not too complex.
For more information about the platform, see http://rvex.ewi.tudelft.nl
A full release (4.1) is available on the site if you wish to do some experiments before the tutorial.
Preliminary Program:
- Intro
- Demos
- Release download & setup
- Lunch
- Hands-on running programs (compilation, simulation, synthesis & circuit simulation, run on board)
- Dynamic core
- Streaming platform
- Maybe some larger programs that need OS support (FreeRTOS/Linux)
- (configuration) Scheduling
- Running participant's code
Detailed Program
TUESDAY (April 04, 2017)
09:00 |
Speaker: Onur Mutlu (ETH Zurich)
|
---|---|
BREAK | |
10:30 |
Session 1 - Adaptive Architectures |
Improving the Performance of Adaptive Cache in Reconfigurable VLIW Processor
| |
LP-P2IP: A Low-power Version of P2IP Architecture using Partial Reconfiguration (FP)
| |
NIM: An HMC-based Machine for Neuron Computation (SP)
| |
VLIW-based FPGA Computation Fabric for Medical Imaging (SP)
| |
LUNCH | |
14:00 |
Session 2 - Embedded Computing and Security |
Hardware Sandboxing: A Novel Defense Paradigm Against Hardware Trojans in Systems on Chip (FP)
| |
Rapid Development of Gzip with MaxJ (FP)
| |
On the Use of (Non-)Cryptographic Hashes on FPGAs (SP)
| |
An FPGA-based Implementation of a Pipelined FFT Processor for High-Speed Signal Processing Applications (SP)
| |
BREAK | |
16:00 |
Session 3 - Simulation and Synthesis |
Soft timing closure for soft programmable logic cores: The ARGen approach (FP)
| |
FPGA Debugging with MATLAB using a Rule-based Inference System (FP)
| |
Hardness Analysis and Instrumentation of Verilog Gate Level Code for FPGA-based Designs (SP)
| |
A Framework for High Level Simulation and Optimization of Coarse-Grained Reconfigurable Architectures (SP)
|
WEDNESDAY (April 05, 2017)
09:00 |
Speaker: Walid Najjar (University of California Riverside, US)
|
---|---|
BREAK | |
10:30 |
Session 4 - Design Space Exploration |
Parameter Sensitivity in Virtual FPGA Architectures (FP)
| |
Custom Framework for Run-time Trading Strategies (FP)
| |
Exploring HLS Optimizations for Efficient Stereo Matching Hardware Implementation (SP)
| |
Architecture Reconfiguration as a Mechanism for Sustainable Performance of Embedded Systems in case of Variations in Available Power (SP)
| |
LUNCH | |
14:00 |
Session 5 - Fault Tolerance |
Exploring Performance and Soft Error Recovery in Dual-Core LockStep ARM A9 Processor Embedded into Xilinx Zynq-7000 APSoC (FP)
| |
Applying TMR in Hardware Accelerators Generated by High-Level Synthesis Design Flow for Mitigating Multiple Bit Upsets in SRAM-based FPGAs (FP)
| |
BREAK | |
16:00 |
SOCIAL EVENT |
THURSDAY (April 06, 2017)
09:00 |
Speaker: Cathal McCabe (Xilinx)
|
---|---|
BREAK | |
10:30 |
Session 6 - FPGA Based Designs |
FPGA Applications in Unmanned Aerial Vehicles - A Review (FP)
| |
Genomic Data Clustering on FPGAs for Compression (FP)
| |
A Quantitative Analysis of the Memory Architecture of FPGA-SoCs (FP)
| |
LUNCH | |
14:00 |
Best Paper Announcement - Award sponsored by Springer |
14:10 |
Session 7 - Neural Networks |
Optimizing CNN-based Object Detection Algorithms on Embedded FPGA Platforms (FP)
| |
An FPGA Realization of a Deep Convolutional Neural Network using a Threshold Neuron Pruning (FP)
| |
Accuracy Evaluation of Long Short Term Memory Network Based Language Model with Fixed-Point Arithmetic (SP)
| |
FPGA Implementation of a Short Read Mapping Accelerator (SP)
| |
BREAK | |
16:00 |
Session 8 - Languages and Estimation Techniques |
dfesnippets: An Open-Source Library for Data flow Acceleration on FPGAs (FP)
| |
A Machine Learning Methodology for Cache Recommendation (FP)
| |
ArPALib: A Big Number Arithmetic Library for Hardware and Software implementations. A Case Study for the Miller-Rabin Primality Test (SP)
|
