ACCRE R9 Cluster Quick and Dirty Status
Report generated at Sun Dec 14 09:23:01 PM CST 2025
Problem Nodes
HOSTNAMES STATE AVAIL_FEATURES TIMESTAMP USER REASON
cn1370 drained* intel_e5-2630_v3,haswell,intel 2025-12-12T11:26:08 root Alex - RT96309 - per Eric, Decom
cn1376 draining intel_e5-2630_v3,haswell,intel 2025-12-14T19:31:04 root Kill task failed
cn1397 draining intel_e5-2630_v3,haswell,intel 2025-12-14T07:36:40 root Kill task failed
cn1586 drained intel_5218,cascadelake,intel,x 2025-12-14T08:40:46 slurm Prolog error
cn1630 drained amd_9754,zen4,zen,amd,x86-64v4 2025-12-10T11:30:55 broadrt Troy - n/a - using to test nic hgx01 because it was
gpu0045 inval turing,intel_5118,skylake,inte 2025-12-11T10:50:21 slurm Low RealMemory (reported:353823 < 100.00% of configu
gpu0057 drained* turing,intel_4214r,cascadelake 2025-12-09T09:29:32 broadrt Alex - RT91252 - Per Eric, Decom
gpu0065 drained a6000,intel_platinum_8358,icel 2025-11-06T09:42:59 root Sai - NA - HS Testing
gpu0066 drained a6000,intel_platinum_8358,icel 2025-10-27T21:50:09 root Sai - N/A - Hammerspace testing
hgx01 drained h100_80gb,amd_9454,zen4,zen,am 2025-12-11T08:32:43 broadrt Eric/Troy - RT95137 - NIC issues, troubleshooting
Queue Summary (Batch)
GROUP USER ACTIVE_JOBS ACTIVE_CORES PENDING_JOBS PENDING_CORES
-----------------------------------------------------------------------------------------
aldrich_lab 1 7 0 0
amannn1 1 7 0 0
-----------------------------------------------------------------------------------------
behringer_lab 1 16 0 0
haleof 1 16 0 0
-----------------------------------------------------------------------------------------
booth_lab 4 11 0 0
chenh55 1 4 0 0
comptoab 1 1 0 0
mathura 1 4 0 0
zhut12 1 2 0 0
-----------------------------------------------------------------------------------------
brg_cores 1 16 0 0
kandelr 1 16 0 0
-----------------------------------------------------------------------------------------
bridge 1 16 0 0
dunhamsj 1 16 0 0
-----------------------------------------------------------------------------------------
cgg 0 0 1 64
liy110 0 0 1 64
-----------------------------------------------------------------------------------------
cms 663 4582 306 702
cmslocal 391 1486 115 223
cmspilot 272 3096 191 479
-----------------------------------------------------------------------------------------
cqs_si 0 0 4 8
chenarsw 0 0 4 8
-----------------------------------------------------------------------------------------
csb_sanders 3 75 0 0
lig7 3 75 0 0
-----------------------------------------------------------------------------------------
davis_lab 0 0 1 16
bluejor 0 0 1 16
-----------------------------------------------------------------------------------------
econ_faculty 1 1 0 0
moroa 1 1 0 0
-----------------------------------------------------------------------------------------
hadjim_lab 2 16 0 0
reasosa2 2 16 0 0
-----------------------------------------------------------------------------------------
h_biostat_student 7 40 0 0
yih4 1 16 0 0
zhoun4 6 24 0 0
-----------------------------------------------------------------------------------------
h_cqs 3 60 0 0
xuh14 3 60 0 0
-----------------------------------------------------------------------------------------
h_darby_lab 3 24 0 0
leej133 3 24 0 0
-----------------------------------------------------------------------------------------
h_vangard_1 1 15 0 0
chenh19 1 15 0 0
-----------------------------------------------------------------------------------------
h_vmac 0 0 2 6
goodinrk 0 0 2 6
-----------------------------------------------------------------------------------------
isde-rer 1 1 0 0
champaca 1 1 0 0
-----------------------------------------------------------------------------------------
l2_jan_lab 2 17 1 1
davida7 1 10 1 1
olivij1 1 7 0 0
-----------------------------------------------------------------------------------------
l3_aboud_lab 1 64 0 0
hongm1 1 64 0 0
-----------------------------------------------------------------------------------------
l3_precision_nutrition_lab 1 52 0 0
baghem1 1 52 0 0
-----------------------------------------------------------------------------------------
l3_runnoe_group 3 48 0 0
kaldorme 3 48 0 0
-----------------------------------------------------------------------------------------
l3_vuiis_cci 1 2 0 0
vuiis_daily_s 1 2 0 0
-----------------------------------------------------------------------------------------
lea_lab 2 13 0 0
watowm1 2 13 0 0
-----------------------------------------------------------------------------------------
leech_simulation 10 160 2860 45760
shij13 10 160 2860 45760
-----------------------------------------------------------------------------------------
maiziezhou_lab 1 10 0 0
yuanw2 1 10 0 0
-----------------------------------------------------------------------------------------
mchaourab 0 0 3 3
kaot1 0 0 3 3
-----------------------------------------------------------------------------------------
mchaourab-csb 1 100 0 0
may19 1 100 0 0
-----------------------------------------------------------------------------------------
mchs_compbio 1 16 0 0
riedlio 1 16 0 0
-----------------------------------------------------------------------------------------
mcml 0 0 2 192
odenyogg 0 0 2 192
-----------------------------------------------------------------------------------------
nbody 175 658 237 591
ligo 175 658 237 591
-----------------------------------------------------------------------------------------
p_csb_meiler 2091 2858 43899 199756
agarwm5 0 0 3248 3248
huntek1 2032 2032 28185 28185
tydingcw 59 826 11989 167846
yange8 0 0 477 477
-----------------------------------------------------------------------------------------
p_dsi 0 0 10 10
yangi1 0 0 10 10
-----------------------------------------------------------------------------------------
p_englot_group 7 376 6 768
makhoug 2 256 6 768
redaa1 5 120 0 0
-----------------------------------------------------------------------------------------
p_matheny_lab 22 129 0 0
koolajd1 22 129 0 0
-----------------------------------------------------------------------------------------
p_meiler 0 0 1 1
yange8 0 0 1 1
-----------------------------------------------------------------------------------------
rer 1 16 0 0
hum6 1 16 0 0
-----------------------------------------------------------------------------------------
r_isde 1 4 0 0
trippej1 1 4 0 0
-----------------------------------------------------------------------------------------
rke_group 22 88 0 0
sleethmr 22 88 0 0
-----------------------------------------------------------------------------------------
rokaslab 2 20 0 0
danist 1 4 0 0
sautet1 1 16 0 0
-----------------------------------------------------------------------------------------
sarkar_lab 1 32 0 0
sarkah1 1 32 0 0
-----------------------------------------------------------------------------------------
sbcs 4 17 0 0
liq17 1 2 0 0
yuanf1 3 15 0 0
-----------------------------------------------------------------------------------------
stassun 1 60 1 60
medani 1 60 1 60
-----------------------------------------------------------------------------------------
taylor_group 7 27 0 0
lambwg 6 24 0 0
petrop3 1 3 0 0
-----------------------------------------------------------------------------------------
walker_lab 27 27 96 96
fieldhm 27 27 96 96
-----------------------------------------------------------------------------------------
wankowicz_lab 800 800 28965 28965
wankows 800 800 28965 28965
-----------------------------------------------------------------------------------------
womelsdorf_lab 1 10 0 0
gerritcg 1 10 0 0
-----------------------------------------------------------------------------------------
yang_lab_csb 0 0 28 504
zhengm9 0 0 28 504
-----------------------------------------------------------------------------------------
zhu_group 1 32 0 0
zhuw12 1 32 0 0
-----------------------------------------------------------------------------------------
Totals: 3878 10516 76423 277503
Queue Summary (Batch GPU)
GROUP USER ACTIVE_JOBS ACTIVE_GPUS PENDING_JOBS PENDING_GPUS
-----------------------------------------------------------------------------------------
accre_guests_acc 1 1 0 0
liy110 1 1 0 0
-----------------------------------------------------------------------------------------
cms_gpu_acc 3 3 0 0
uscmslocal 3 3 0 0
-----------------------------------------------------------------------------------------
csb_gpu_acc 6 16 0 0
karadim 1 4 0 0
lybrantp 1 1 0 0
ranx 1 4 0 0
walkeas2 1 1 0 0
zhengm9 2 6 0 0
-----------------------------------------------------------------------------------------
maiziezhou_lab_acc 23 23 121 121
chenp12 23 23 120 120
zhuy45 0 0 1 1
-----------------------------------------------------------------------------------------
mchaourab_acc 1 4 3 3
kaot1 0 0 3 3
wut18 1 4 0 0
-----------------------------------------------------------------------------------------
nbody_acc 1 1 0 0
bustam1 1 1 0 0
-----------------------------------------------------------------------------------------
p_dsi_acc 1 2 0 0
rajanb1 1 2 0 0
-----------------------------------------------------------------------------------------
p_meiler_acc 0 0 1 1
scotj14 0 0 1 1
-----------------------------------------------------------------------------------------
Totals: 36 50 125 125
Queue Summary (interactive)
GROUP USER ACTIVE_JOBS ACTIVE_CORES PENDING_JOBS PENDING_CORES
-----------------------------------------------------------------------------------------
edwards_lab_int 1 4 0 0
seaglehm 1 4 0 0
-----------------------------------------------------------------------------------------
g_giri_group_int 1 4 0 0
breyem3 1 4 0 0
-----------------------------------------------------------------------------------------
rubinov_lab_int 1 32 0 0
mohamb2 1 32 0 0
-----------------------------------------------------------------------------------------
yang_lab_int 1 8 0 0
shaoq1 1 8 0 0
-----------------------------------------------------------------------------------------
Totals: 4 48 0 0
Queue Summary (interactive_gpu)
GROUP USER ACTIVE_JOBS ACTIVE_GPUS PENDING_JOBS PENDING_GPUS
-----------------------------------------------------------------------------------------
dsi_dgx_iacc 3 9 5 12
deshmus 0 0 1 1
donovcl1 0 0 1 1
may19 1 1 0 0
mohamb2 0 0 2 4
wangr32 1 6 1 6
wut18 1 2 0 0
-----------------------------------------------------------------------------------------
Totals: 3 9 5 12
Partition Summary
PARTITION AVAIL TIMELIMIT NODES STATE NODELIST
interactive up 14-00:00:0 4 mix cn[1287,1301-1302,1804]
interactive up 14-00:00:0 23 idle cn[1322-1326,1328-1330,1707,1800-1803,1805-1814]
batch* up 14-00:00:0 1 drain* cn1370
batch* up 14-00:00:0 2 drng cn[1376,1397]
batch* up 14-00:00:0 2 drain cn[1586,1630]
batch* up 14-00:00:0 51 mix cn[1479,1486,1499,1506,1512-1513,1528-1529,1540,1545,1548-1553,1562-1565,1567,1569,1571,1573-1574,1578,1582,1584,1589,1594-1596,1603,1605,1607,1609,1612,1614-1616,1621,1624-1626,1629,1631-1633,1705-1706,1708]
batch* up 14-00:00:0 331 alloc cn[1202-1213,1215-1242,1257-1262,1264-1286,1288-1299,1303-1318,1320-1321,1327,1331-1355,1357-1369,1371-1375,1377-1385,1387-1396,1398-1412,1414-1427,1430-1432,1434-1443,1445-1450,1452-1458,1460-1464,1466-1478,1480-1485,1487-1498,1500-1505,1507-1511,1514-1520,1522-1527,1530-1538,1543-1544,1546-1547,1554-1559,1561,1566,1568,1570,1575-1577,1579-1581,1583,1585,1587-1588,1592-1593,1597,1602,1604,1608,1610,1613,1617-1620,1622-1623,1627-1628,1700-1703,2000]
batch* up 14-00:00:0 2 idle cn[1606,1704]
batch_gpu up 14-00:00:0 1 inval gpu0045
batch_gpu up 14-00:00:0 1 drain* gpu0057
batch_gpu up 14-00:00:0 3 drain gpu[0065-0066],hgx01
batch_gpu up 14-00:00:0 14 mix gpu[0013,0039,0042,0046,0049-0050,0053,0059,0062,0064,0067,0082,0300],hgx02
batch_gpu up 14-00:00:0 1 alloc gpu0071
batch_gpu up 14-00:00:0 41 idle gpu[0015,0017-0022,0026-0027,0033-0034,0060-0061,0063,0068-0070,0072-0081,0084-0085,0301-0310],gracehopper[01-02]
interactive_gpu up 14-00:00:0 1 resv dgx04
interactive_gpu up 14-00:00:0 3 mix dgx[01,03],gpu0058
interactive_gpu up 14-00:00:0 2 idle dgx02,gpu0207
sam up 2-02:00:00 1 alloc cms-sam-02
sam up 2-02:00:00 1 idle cms-sam-01
reserved up infinite 1 resv dgx04