- Als neu kennzeichnen
- Lesezeichen
- Abonnieren
- Stummschalten
- RSS-Feed abonnieren
- Kennzeichnen
- Anstößigen Inhalt melden
I have very large symmetric NxN matrix (N~4-5x10^4) and need to compute y=A*x. I have been using dspmv from blas 2 for matrix vector multiplication since the packed matrix allows me enough storage. How do parallelize this for a multi-core machine while not increasing my memory footprint greatly? Memory requirements for such a problem size is right now 6 GB and I want to limit to 8 GB. So unpacking into a full matrix is not an option.
Link kopiert
1 Antworten
- Als neu kennzeichnen
- Lesezeichen
- Abonnieren
- Stummschalten
- RSS-Feed abonnieren
- Kennzeichnen
- Anstößigen Inhalt melden
Please allow me to ask some questions for clarification:
1) Are you doing multi-threading (shared-memory) parallelization?
2) Would you like to parallelize matrix-vector multiplication by doing multiple dspmv calls from different threads?
I cannot think of an easy way to achieve (2). Also, matrix-vector multiplication for your problem sizes may show poor scaling for single-socket systems.
Thank you,
Efe
Antworten
Themen-Optionen
- RSS-Feed abonnieren
- Thema als neu kennzeichnen
- Thema als gelesen kennzeichnen
- Diesen Thema für aktuellen Benutzer floaten
- Lesezeichen
- Abonnieren
- Drucker-Anzeigeseite