|
|
Pages: [1] 2 3 ... 17
|
 |
|
Author
|
Topic: AK V8 + CUDA MB team work mod (Read 26962 times)
|
|
Raistmer
|
This version intended to allow simultaneous SETI MB processing on CPU and GPU on the same host. It's just workaround of BOINC's inability to maintain such config. So, as any workaround it has its own advantages, disadvantages and limitations. Peruse known issues and use this package wisely. Please, report all unknows issues in this thread. Lets begin: 1) This first "proof of concept" version will work only on SSSE3-capable hosts (sorry AMD fans, if this will work SSE3 will be supported too of course). Only Intel Core and up for now.1a) Now SSSE3,SSE3 and SSE2 supported. AMD CPUs should use SSE3-variant with _AMD in name (if CPU supports SSE3 of course  otherwise - SSE2 ). 2) This package can work ONLY on SETI main. Even don't try to use it on SETI beta - you can lose tasks for nothing. 3) This CPU-GPU team will not play nicely with other GPU-related projects like GPU-grid. It's because no BOINC GPU-management mechanism is used in this version. This fact has positive side too - you do not need GPU-aware BOINC at all. You just need host with CUDA-supported GPU. It should work even with BOINC 5.xx 4) <ncpus>NUMBER_OF_CORES+1</ncpus> is REQUIRED for productive work. If you let BOINC manage CPU cores number you will end up with one idle core, trust me  5) This AK V8 build was not PGOed so it will show worse performance than current CPU-only AK V8 SSSE3x app (will be fixed if this approach will be useful)(fixed already) 6) Probably will not use second GPU on dual-GPU hosts. (confirmed) How it works: for BOINC it looks as usual CPU opt app installed. BOINC will call CPU app (AK_v8b_win_SSSE3x_GPU_CPU_team.exe in our case) ans assign one of SETI MB tasks for it. But this app aware of possibility to use GPU for computations. It will check if another instance (it knows only itself and its clones, so - no other GPU-related projects please) already use GPU and if not - will start GPU-related app (MB_6.08_mod_CPU_team_CUDA.exe in our case) and suspend itself until GPU app finish. This CUDA app will do all work as usually but will do it on GPU leave CPU almost free. That's why you should increase number of cores. BOINC should run NUMBER_OF_CORES+1 app thinking they all are CPU-related (some cheating of poor old BOINC here  ) After installation try to keep eye on first few results - this is pretty new approach and I can't give any guaranties if it will work for your config. If something will go wrong, please, revert to old variant you used before and describe your issue in this thread. P.S. Now you can easily see how fast CUDA is indeed (on non-VLAR tasks). CPU apps completed <20% of their tasks when CUDA app finished its first task on my Q9450+9600GSO host  Enjoy!
|
|
|
« Last Edit: 04 Feb 2009, 01:48:03 pm by Raistmer »
|
Logged
|
|
|
|
Richard Haselgrove
Alpha Tester
Knight who says 'Ni!'
 
Offline
Posts: 970
|
I was in the middle of flushing my Beta queue on the Q9300 anyway, since we have no new versions to test just now. I'll give this a run when I'm ready to switch, but it may be six hours or so till I get there - not quite sure how many of the remaining tasks are VLAR.
Have you any idea how compatible this package is with AP, if I transfer my r103 block into the app_info?
|
|
|
|
|
Logged
|
|
|
|
|
Raistmer
|
AP r103 is included as usually ! IT should work with AP too. But if you will have only AP tasks it will do NUMBER_OF_CORES+1 AP r103 and no CUDA MB. => GPU will be idle. (AP r103 provided "as is" w/o any modifications, it will use CPU just as will do any another app from another CPU-only project as Einstein for example. I expect some issues only with another GPU-enabled projects. And some inefficiencies because of BOINC inability to pair projects. This VERY needed feature still missing! We need option to pair one project with another.... I will post beta-site variant in pre-release area. This one will not work on beta.
|
|
|
|
« Last Edit: 25 Jan 2009, 07:07:20 am by Raistmer »
|
Logged
|
|
|
|
Grey Shadow
Guest
|
Hi again. Great mod, thanks a lot. The first one allowing me to crunch MB units with CUDA and at the same time use my PC for other tasks. However one bug still persists (it is present also in stock 6.08, according to my experience). Certain MB workunits start as usually (about 30 seconds with CPU, then switch to GPU), but are processed extremely slow (almost as AP ones). Also when they are processed PC starts working much slower so usually I have to cancel them manually. The sample of such workunit is http://setiathome.berkeley.edu/result.php?resultid=1134111853It looke like this is the same VLAR issue? Is it possible to do something to prevent this? Do you mind adding VLAR-killing ability to this mod?
|
|
|
|
« Last Edit: 25 Jan 2009, 01:39:39 pm by Grey Shadow »
|
Logged
|
|
|
|
|
Raistmer
|
Is it possible to do something to prevent this? Do you mind adding VLAR-killing ability to this mod?
Yes. This mod could be extended in two ways. 1) More advanced CPU/GPU choosing at app start. That is, if task is VLAR - stay with CPU, else - try pass task to GPU. 2) VLAR autokill mod.
|
|
|
|
|
Logged
|
|
|
|
Grey Shadow
Guest
|
Thanks, will be waiting for these modifications 
|
|
|
|
« Last Edit: 25 Jan 2009, 02:21:45 pm by Grey Shadow »
|
Logged
|
|
|
|
Richard Haselgrove
Alpha Tester
Knight who says 'Ni!'
 
Offline
Posts: 970
|
Finally cleared the last of my VLARs at Beta, installed this mod - and guess what, I got a 20-full of VLAR at Main! Like http://setiathome.berkeley.edu/result.php?resultid=1134023275 - AR=0.009331 This rig will do that sort of AR quicker on the CPU than the GPU, so I'd be interested in option (1) - hand over to CPU at VLAR - rather than autokill.
|
|
|
|
|
Logged
|
|
|
|
|
Raistmer
|
Finally cleared the last of my VLARs at Beta, installed this mod - and guess what, I got a 20-full of VLAR at Main! Like http://setiathome.berkeley.edu/result.php?resultid=1134023275 - AR=0.009331 This rig will do that sort of AR quicker on the CPU than the GPU, so I'd be interested in option (1) - hand over to CPU at VLAR - rather than autokill. It's really possible. Especially on quad! Algorithm: 1) Suspend all tasks. 2) resume non-VLAR task - it will be assigned to CUDA app. (I suppose SETI is single active project - you could enhance algorithm for more complex cases) 3) resume 4 (for quad) VLAR tasks - they will be assigned to CPU. Resume some non-VLARS for GPU. (GPU will crunch much faster so while CPU finish VLAR, GPU will finish few non-VLARs).
|
|
|
|
|
Logged
|
|
|
|
|
Raistmer
|
Thanks, will be waiting for these modifications  No sense to wait - use current version right now instead. Even with slow VLAR processing the use GPU is free addon to CPU now (of course if you don't participate in other GPU-related projects.)
|
|
|
|
|
Logged
|
|
|
|
Richard Haselgrove
Alpha Tester
Knight who says 'Ni!'
 
Offline
Posts: 970
|
Algorithm: 1) Suspend all tasks. 2) resume non-VLAR task - it will be assigned to CUDA app. (I suppose SETI is single active project - you could enhance algorithm for more complex cases) 3) resume 4 (for quad) VLAR tasks - they will be assigned to CPU. Resume some non-VLARS for GPU. (GPU will crunch much faster so while CPU finish VLAR, GPU will finish few non-VLARs).
Snag: 1) CPUs busy on 2 x AP, and 2 x Einstein - can't divert them to GPU 2) MB cache currently consists entirely of VLAR - all too similar to my Beta experience So like you say, I'll just treat the GPU as a free bonus, and let it carry on at the best speed it can manage. One possible drawback of your mod, and the way it bypasses BOINC's knowledge of CUDA: it doesn't do task-switching very well - I've already found it trying to run 5 x CPU and 0 x GPU a couple of times. Easy to manage with task suspension, but it still means babysitting.
|
|
|
|
|
Logged
|
|
|
|
|
Raistmer
|
One possible drawback of your mod, and the way it bypasses BOINC's knowledge of CUDA: it doesn't do task-switching very well - I've already found it trying to run 5 x CPU and 0 x GPU a couple of times. Easy to manage with task suspension, but it still means babysitting.
If only SETI MB used - all will be fine - always 4+1. If you will use AP too - things bacome complicated - yes, 5+0 possible (if no free SETI MB tasks) And if another project is used too, well, chances to get 5+0 increase. It could be solved by increasing SETI project share. Sure task scheduling is BOINC job, no science app job. And my mod just works instead BOINC now.
|
|
|
|
|
Logged
|
|
|
|
|
Raistmer
|
This is replacement for CUDA part of V8 of my package. This build has VLAR autokill ability. You need to decompress this file into SETI project directory (when BOINC switched off of course) if you don't want process VLAR on CUDA.
That is, if VLAR task goes to CPU app - it will be processed. If it goes to CUDA app - task will be aborted with error (BAD_HEADER).
EDIT: There was wrong file name (.exe twice). Archive replaced, please, remove secondary .exe by renaming manually or download new archive.
|
|
|
« Last Edit: 25 Jan 2009, 05:33:17 pm by Raistmer »
|
Logged
|
|
|
|
|
Raistmer
|
V8a update available. Changes:
- VLAR autokill mod enabled for CUDA app - PGO redone for CPU app - Wall-clock elapsed time since last restart (or since start of task) is added to stderr for both apps.
|
|
|
|
|
Logged
|
|
|
|
Slawek
Knight o' The Realm

Offline
Posts: 51
|
AMD SSE3 (X2 Athlon ) working on this build ?
|
|
|
|
|
Logged
|
|
|
|
|
Raistmer
|
AMD SSE3 (X2 Athlon ) working on this build ?
No, will do SSE3 soon. BTW, does anyone need SSE2 and low ? That is, does anybody have SSE2-only CPU with CUDA-enabled GPU ?
|
|
|
|
|
Logged
|
|
|
|
|
Pages: [1] 2 3 ... 17
|
|
|
|
Quote!
Two wrongs are only the beginning.- Murphy's Law
|
 |  |  |
| |
| Site Statistics |
| Total Members: | 123 |
| Total Posts: | 29,785 |
| Total Topics: | 892 | | Downloads |
| Apps |
| Windows R-1.x | 0 |
| Windows R-2.0 | 0 |
| Windows R-2.2 | 0 |
| Linux 32bit 1.x | 0 |
| Linux 32bit 2.2 | 0 |
| Linux 64bit 2.2 | 0 |
| Alpha/IA64 | 1,938 |
| FreeBSD | 0 |
| HPUX | 0 |
| Subtotal: | 0 |
| Source packs: | 5,803 |
| Tool/WU packs: | 10,078 |
| Total: | 22,040 | | GBs dl'd: | 309.53 | | Pages served |
| Today: | 6,653 |
| Total: | 8,668,257 |
| (since 6/26/2006) |
| 173 Donations to S@H |
| U.S. Dollars: | 3,196.59 |
| Euros: | 863.90 |
| Last 24h: | $ 0.00 |
| Avg./24h: | $ 3.32 |
| Estim. total: | $ 4,319.66 |
Latest Member: Miep |
| |
 | |  |
 |  |  |
| |
Online users/last 15m
33 Guests, 6 Users
Ghost, Tye, Raistmer, Claggy, Vyper, perryjay 43 Members/last 24hGhost, Tye, Raistmer, Claggy, Vyper, perryjay, Jason G, SciManStev, k6xt, Morten, arkayn, Slawek, cristipurdel, benool, Frizz, Purple Rabbit, sunu, Wild6-NJ, corsair, M_M, Franz, PatrickV2, JohnDK, _heinz, cenit, Josef W. Segur, glennaxl, skildude, msattler, mr.mac52, Geek@Play, Gizbar, Devaster, WHRoeder, kit344, Byron Leigh Hatch @ team Carl Sagan, TouchuvGrey, Metod, S56RKO, Questor, VoidPilot, The Grinch, hiamps, Pepi
| |
 | |  |
|