Community
cancel
Showing results for 
Search instead for 
Did you mean: 
140 Views

base image configured with JTAG can’t communicate over PCIe

We are having issues when writing a partial reconfig image to the FPGA over PCIe. It looks like the base image configured with JTAG can’t communicate over PCIe. The issue persists after a soft reboot following configuration. lspci shows the device.

after following"Initializing the Intel Arria 10 GX FPGA Development Kit for use with OpenC" instructions flashing "aocl flash acl0 boardtest.aocx"

"aocl diagnose" passes.

but then following the OpenVino Install linux with FPGA instructions

The outputs from the program step and a subsequent diagnose attempt are below.

aocl program acl0 1-0-1_A10DK_FP16_ResNet.aocx

aocl program: Running program from /opt/intel/intelFPGA_pro/17.0/hld/board/a10_ref/linux64/libexec

Programming device: a10gx : Arria 10 Reference Platform (acla10_ref0)

Reprogramming device [0] with handle 1

MMD INFO : [acla10_ref0] PR base and import compile IDs do not match

MMD INFO : [acla10_ref0] PR base ID currently configured is 0x2409880c

MMD INFO : [acla10_ref0] PR import compile expects ID to be 0x124020ee

MMD INFO : [acla10_ref0] Falling back to JTAG programming instead of PR

MMD INFO : JTAG Autodetect device found Cable:1 , Device Index:1

MMD INFO : setting Cable to autodetect value 1

MMD INFO : setting Device Index to autodetect value 1

MMD INFO : executing "quartus_pgm -c 1  -m jtag -o "P;reprogram_temp.sof@1""

Device: 10AX115S2F45

Warning (210120): Arria 10 information is incomplete. The ISP clamp functionality will be disabled.

Info: *******************************************************************

Info: Running Quartus Prime Programmer

    Info: Version 17.0.0 Build 290 04/26/2017 SJ Pro Edition

    Info: Copyright (C) 2017  Intel Corporation. All rights reserved.

    Info: Your use of Intel Corporation's design tools, logic functions

    Info: and other software and tools, and its AMPP partner logic

    Info: functions, and any output files from any of the foregoing

    Info: (including device programming or simulation files), and any

    Info: associated documentation or information are expressly subject

    Info: to the terms and conditions of the Intel Program License

    Info: Subscription Agreement, the Intel Quartus Prime License Agreement,

    Info: the Intel MegaCore Function License Agreement, or other

    Info: applicable license agreement, including, without limitation,

    Info: that your use is for the sole purpose of programming logic

    Info: devices manufactured by Intel and sold by Intel or its

    Info: authorized distributors.  Please refer to the applicable

    Info: agreement for further details.

    Info: Processing started: Mon Aug 13 13:58:29 2018

Info: Command: quartus_pgm -c 1 -m jtag -o P;reprogram_temp.sof@1

Info (213045): Using programming cable "USB-BlasterII [3-1]"

Info (213011): Using programming file reprogram_temp.sof with checksum 0x343B7836 for device 10AX115S2F45@1

Info (209060): Started Programmer operation at Mon Aug 13 13:58:37 2018

Info (209016): Configuring device index 1

Info (209017): Device 1 contains JTAG ID code 0x02E660DD

Info (209007): Configuration succeeded -- 1 device(s) configured

Info (209011): Successfully performed operation(s)

Info (209061): Ended Programmer operation at Mon Aug 13 13:58:53 2018

Info: Quartus Prime Programmer was successful. 0 errors, 1 warning

    Info: Peak virtual memory: 1749 megabytes

    Info: Processing ended: Mon Aug 13 13:58:53 2018

    Info: Elapsed time: 00:00:24

    Info: Total CPU time (on all processors): 00:00:13

MMD INFO : [acla10_ref0] PCIe-to-fabric read test failed, read 0xa0c7c1e6 after 1 attempts

mmd program_device:  Board reprogram failed

  OpenCL Notification Callback: Reprogram of device failed

Failed clCreateProgramWithBinary.

Error code: -2

aocl program: Program failed.

 

aocl diagnose

aocl diagnose: Running diagnose from /opt/intel/intelFPGA_pro/17.0/hld/board/a10_ref/linux64/libexec

aocl diagnose: failed 1 times. First error below:

Vendor: Intel(R) Corporation

MMD INFO : [acla10_ref0] PCIe-to-fabric read test failed, read 0xa0c7c1e6 after 1 attempts

Phys Dev Name  Status   Information

acla10_ref0   Failed   Board name not available.

                       Failed initial tests, so not working as expected.

                       Please try again after reprogramming the device.

Found no active device installed on the host machine.

Please make sure to:

      1. Set the environment variable AOCL_BOARD_PACKAGE_ROOT to the correct board package.

      2. Install the driver from the selected board package.

      3. Properly install the device in the host machine.

      4. Configure the device with a supported OpenCL design.

      5. Reboot the machine if the PCI Express link failed.

DIAGNOSTIC_FAILED

Any suggestions on what may be the issue, could it be the pci drivers on centos 7.4?

 

0 Kudos
2 Replies
Monique_J_Intel
Employee
140 Views

One thing you should try is:

lspci -vv 1172:2494

You should see "aclpci_a10_ref" and "aclpci_a10_ref_drv". If not, then it is a driver issue.

Let me know your results and we can proceed from there.

Kind Regards,

Monique Jones

140 Views

After lspci -vv

I see this

06:00.0 Processing accelerators: Altera Corporation Device 2494 (rev 01) (prog-if 01)
        Subsystem: Altera Corporation Device a151
        Control: I/O- Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx-
        Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
        Latency: 0, Cache Line Size: 64 bytes
        Interrupt: pin A routed to IRQ 16
        NUMA node: 0
        Region 0: Memory at f2140000 (64-bit, prefetchable) [size=512]
        Region 4: Memory at f2100000 (64-bit, prefetchable) [size=256K]
        Capabilities: [50] MSI: Enable- Count=1/4 Maskable- 64bit+
                Address: 0000000000000000  Data: 0000
        Capabilities: [78] Power Management version 3
                Flags: PMEClk- DSI- D1- D2- AuxCurrent=0mA PME(D0-,D1-,D2-,D3hot-,D3cold-)
                Status: D0 NoSoftRst- PME-Enable- DSel=0 DScale=0 PME-
        Capabilities: [80] Express (v2) Endpoint, MSI 00
                DevCap: MaxPayload 256 bytes, PhantFunc 0, Latency L0s <64ns, L1 <1us
                        ExtTag- AttnBtn- AttnInd- PwrInd- RBE+ FLReset- SlotPowerLimit 25.000W
                DevCtl: Report errors: Correctable- Non-Fatal- Fatal- Unsupported-
                        RlxdOrd- ExtTag- PhantFunc- AuxPwr- NoSnoop+
                        MaxPayload 128 bytes, MaxReadReq 512 bytes
                DevSta: CorrErr- UncorrErr- FatalErr- UnsuppReq- AuxPwr- TransPend-
                LnkCap: Port #1, Speed 8GT/s, Width x8, ASPM not supported, Exit Latency L0s <4us, L1 <1us
                        ClockPM- Surprise- LLActRep- BwNot- ASPMOptComp+
                LnkCtl: ASPM Disabled; RCB 64 bytes Disabled- CommClk+
                        ExtSynch- ClockPM- AutWidDis- BWInt- AutBWInt-
                LnkSta: Speed 5GT/s, Width x4, TrErr- Train- SlotClk+ DLActive- BWMgmt- ABWMgmt-
                DevCap2: Completion Timeout: Not Supported, TimeoutDis+, LTR-, OBFF Not Supported
                DevCtl2: Completion Timeout: 50us to 50ms, TimeoutDis-, LTR-, OBFF Disabled
                LnkCtl2: Target Link Speed: 5GT/s, EnterCompliance- SpeedDis-
                         Transmit Margin: Normal Operating Range, EnterModifiedCompliance- ComplianceSOS-
                         Compliance De-emphasis: -6dB
                LnkSta2: Current De-emphasis Level: -6dB, EqualizationComplete-, EqualizationPhase1-
                         EqualizationPhase2-, EqualizationPhase3-, LinkEqualizationRequest-
        Capabilities: [100 v1] Virtual Channel
                Caps:   LPEVC=0 RefClk=100ns PATEntryBits=1
                Arb:    Fixed- WRR32- WRR64- WRR128-
                Ctrl:   ArbSelect=Fixed
                Status: InProgress-
                VC0:    Caps:   PATOffset=00 MaxTimeSlots=1 RejSnoopTrans-
                        Arb:    Fixed- WRR32- WRR64- WRR128- TWRR128- WRR256-
                        Ctrl:   Enable+ ID=0 ArbSelect=Fixed TC/VC=01
                        Status: NegoPending- InProgress-
        Capabilities: [200 v1] Vendor Specific Information: ID=1172 Rev=0 Len=044 <?>
        Capabilities: [300 v1] #19
        Kernel driver in use: aclpci_a10_ref
        Kernel modules: aclpci_a10_ref_drv

Reply