<?xml version="1.0" encoding="UTF-8" ?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:media="http://search.yahoo.com/mrss/">
	<channel>
		<title>RCAC - Outages and Maintenance, Outages, Maintenance</title>
		<link>https://db.rcac.purdue.edu/news/rss/Outages%20and%20Maintenance</link>
		<description><![CDATA[news::news.feed description]]></description>
		<atom:link href="https://db.rcac.purdue.edu/news/rss/Outages%20and%20Maintenance" rel="self" type="application/rss+xml" />
		<language>en</language>
		<lastBuildDate>Tue, 05 May 2026 21:57:14 EDT</lastBuildDate>
					<item>
				<title><![CDATA[Scheduled Bell Maintenance – May 18]]></title>
				<link>https://db.rcac.purdue.edu/news/7675</link>
				<guid isPermaLink="true">https://db.rcac.purdue.edu/news/7675</guid>
				<description><![CDATA[<p>Bell will be undergoing planned maintenance and a minor operating system upgrade on Monday, May 18. This work will also affect the Bell scratch filesystem.</p>
<ul>
<li>Maintenance work will start around 8:00 AM and should end by 5:00 PM Monday, May 18.</li>
<li>A reservation is in place starting at 3:00 AM on May 18 so new jobs do not run into the maintenance window.</li>
<li>Jobs that would run during the maintenance period should be scheduled for another time.</li>
<li>We will send out an update when maintenance is complete or we have more information to share.</li>
</ul>
<p>If you have any questions or concerns about this maintenance, please contact us at <a href="mailto:rcac-help@purdue.edu">rcac-help@purdue.edu</a>.</p>
]]></description>
				<pubDate>Mon, 18 May 2026 08:00:00 -0400</pubDate>
									<category>Maintenance</category>
							</item>
					<item>
				<title><![CDATA[Github monthly maintenance]]></title>
				<link>https://db.rcac.purdue.edu/news/7674</link>
				<guid isPermaLink="true">https://db.rcac.purdue.edu/news/7674</guid>
				<description><![CDATA[<p>The Purdue Github services will be unavailable Wednesday, May 6, 2026 from 3:00pm - 5:00pm EDT for scheduled monthly maintenance (first Wednesday of every month).</p>
<p>During this time, the Github appliances will receive normal software updates, and you may have difficulties accessing your repositories.</p>
]]></description>
				<pubDate>Wed, 06 May 2026 15:00:00 -0400</pubDate>
									<category>Maintenance</category>
							</item>
					<item>
				<title><![CDATA[Fortress Archive Monthly Maintenance]]></title>
				<link>https://db.rcac.purdue.edu/news/7671</link>
				<guid isPermaLink="true">https://db.rcac.purdue.edu/news/7671</guid>
				<description><![CDATA[<p>The Fortress Archive will be unavailable Wednesday, May 6, 2026 at 8:00am EDT for scheduled monthly maintenance (first Wednesday of every month).</p>
<p>During this time, Fortress will receive normal software and hardware updates. Any transfers which request files sent to or from Fortress will either block or fail until this maintenance is completed.</p>
]]></description>
				<pubDate>Wed, 06 May 2026 08:00:00 -0400</pubDate>
									<category>Maintenance</category>
							</item>
					<item>
				<title><![CDATA[Kernel Upgrade on Clusters Nodes]]></title>
				<link>https://db.rcac.purdue.edu/news/7670</link>
				<guid isPermaLink="true">https://db.rcac.purdue.edu/news/7670</guid>
				<description><![CDATA[<p>We are performing rolling reboots on all login and compute nodes for critical kernel upgrades. Nodes will reboot in stages, so resources remain available.</p>
]]></description>
				<pubDate>Thu, 30 Apr 2026 09:30:00 -0400</pubDate>
									<category>Outages and Maintenance</category>
							</item>
					<item>
				<title><![CDATA[Globus issue affecting Purdue “Data Depot” endpoint]]></title>
				<link>https://db.rcac.purdue.edu/news/7669</link>
				<guid isPermaLink="true">https://db.rcac.purdue.edu/news/7669</guid>
				<description><![CDATA[<p>We are currently investigating problems with the Purdue Research Computing – Data Depot Globus endpoint that cause directory listings and transfers to time out with errors such as <code>ExternalError.DirListingFailed.Timeout</code>.</p>
<p>Users may be unable to browse directories or start new transfers involving this endpoint.  There is no workaround at this time other than retrying later or using an alternate endpoint when possible.  Our next update will be posted on or before 11:00am Thursday 4/30/26 or sooner.</p>
]]></description>
				<pubDate>Wed, 29 Apr 2026 11:30:00 -0400</pubDate>
									<category>Outages</category>
							</item>
					<item>
				<title><![CDATA[Gilbreth Cluster Open OnDemand Maintenance (April 28)]]></title>
				<link>https://db.rcac.purdue.edu/news/7665</link>
				<guid isPermaLink="true">https://db.rcac.purdue.edu/news/7665</guid>
				<description><![CDATA[<p>The Open Ondemand service for Gilbreth will be unavailable <strong>from Tuesday, April 28 at 9:00am EDT, 2026 to Tuesday, April 28 at 5:00pm EDT, 2026.</strong> During the maintenance, RCAC team will perform a reconfiguration to the Open Ondemand dashboard for Gilbreth which include a brand new design of the dashboard with new features listed below.</p>
<h3>What’s New on the dashboard?</h3>
<ul>
<li>
<strong>GPU Usage:</strong> Monitor your group usages and remaining available GPUs on Gilbreth.</li>
<li>
<strong>Disk Usage:</strong> Monitor your storage utilization across Gilbreth’s file systems.</li>
<li>
<strong>Job Queue:</strong> View and manage your running and queued jobs on Gilbreth.</li>
<li>
<strong>News Feed:</strong> Stay updated with the latest Gilbreth news, outages and announcements.</li>
<li>
<strong>Partition Status:</strong> Monitor the current state of partitions/queues on Gilbreth.</li>
<li>
<strong>My Jobs Page:</strong> Re-designed page to show detailed job information for your jobs and jobs in your group(s) as well as job management.</li>
<li>
<strong>Performance Metrics Page:</strong> Analyze your job performance and resource utilization patterns over time.</li>
</ul>
<h3>What will impact you?</h3>
<ul>
<li>All Slurm jobs on Gilbreth (including jobs that have already submitted through Open Ondemand before this maintenance) will continue and <strong>NOT</strong> be impacted.</li>
<li>All functions related to Open Ondemand including login  will be unavailable during the maintenance.</li>
</ul>
<p>Gilbreth Open Ondemand service will return to full production by <strong>Tuesday, April 28 at 5:00pm EDT, 2026</strong>.</p>
<p>Please submit a ticket through RCAC Help Desk <a href="mailto:rcac-help@purdue.edu">rcac-help@purdue.edu</a> if you have any questions or suggestions.</p>
]]></description>
				<pubDate>Tue, 28 Apr 2026 08:00:00 -0400</pubDate>
									<category>Maintenance</category>
							</item>
					<item>
				<title><![CDATA[All Clusters & Services Restored]]></title>
				<link>https://db.rcac.purdue.edu/news/7668</link>
				<guid isPermaLink="true">https://db.rcac.purdue.edu/news/7668</guid>
				<description><![CDATA[<p>At approximately 9:40pm EDT Monday, April 27th, 2026, a campus-wide power outage due to weather impacted all research computing clusters and storage systems. Power has been restored, and engineers are currently bringing systems back online in an orderly fashion. Access to various systems has been paused as this is being addressed.</p>
<p>We will provide an update by noon, April 28 or sooner as work progresses.</p>
]]></description>
				<pubDate>Mon, 27 Apr 2026 21:40:00 -0400</pubDate>
									<category>Outages</category>
							</item>
					<item>
				<title><![CDATA[RCAC Maintenance Complete – All Systems Available]]></title>
				<link>https://db.rcac.purdue.edu/news/7666</link>
				<guid isPermaLink="true">https://db.rcac.purdue.edu/news/7666</guid>
				<description><![CDATA[<p>The ongoing RCAC maintenance originally scheduled to conclude at 5:00 PM today (Thursday, April 23) has been extended. The new expected completion time is 9:00 PM tonight, Thursday, April 23.</p>
<p>All RCAC systems and Research Network services will remain unavailable during this extended window.</p>
<p>We appreciate your continued patience as we complete these critical upgrades to improve RCAC infrastructure reliability and performance.</p>
<p>For assistance, questions, or concerns, please contact <a href="mailto:rcac-help@purdue.edu">rcac-help@purdue.edu</a>.</p>
]]></description>
				<pubDate>Wed, 22 Apr 2026 06:00:00 -0400</pubDate>
									<category>Outages</category>
							</item>
					<item>
				<title><![CDATA[Data Depot is up, but some operations may be slower than normal while recovery work continues]]></title>
				<link>https://db.rcac.purdue.edu/news/7662</link>
				<guid isPermaLink="true">https://db.rcac.purdue.edu/news/7662</guid>
				<description><![CDATA[<p>Data Depot is currently experiencing degraded performance. Service is available, but background recovery processes are still running and may slow access to Depot. Our team is continuing to monitor the system and work toward full restoration.</p>
]]></description>
				<pubDate>Tue, 14 Apr 2026 13:00:00 -0400</pubDate>
									<category>Outages</category>
							</item>
					<item>
				<title><![CDATA[Data Depot is currently unavailable]]></title>
				<link>https://db.rcac.purdue.edu/news/7660</link>
				<guid isPermaLink="true">https://db.rcac.purdue.edu/news/7660</guid>
				<description><![CDATA[<p>We are currently experiencing an unplanned outage affecting Data Depot. Engineers are actively working to restore service, but there is no ETA to share yet. We will provide updates as soon as more information is available.</p>
]]></description>
				<pubDate>Tue, 14 Apr 2026 09:30:00 -0400</pubDate>
									<category>Outages</category>
							</item>
					<item>
				<title><![CDATA[Github monthly maintenance]]></title>
				<link>https://db.rcac.purdue.edu/news/7652</link>
				<guid isPermaLink="true">https://db.rcac.purdue.edu/news/7652</guid>
				<description><![CDATA[<p>The Purdue Github services will be unavailable Wednesday, April 1, 2026 from 3:00pm - 5:00pm EDT for scheduled monthly maintenance (first Wednesday of every month).</p>
<p>During this time, the Github appliances will receive normal software updates, and you may have difficulties accessing your repositories.</p>
]]></description>
				<pubDate>Wed, 01 Apr 2026 15:00:00 -0400</pubDate>
									<category>Maintenance</category>
							</item>
					<item>
				<title><![CDATA[Fortress Archive Monthly Maintenance]]></title>
				<link>https://db.rcac.purdue.edu/news/7648</link>
				<guid isPermaLink="true">https://db.rcac.purdue.edu/news/7648</guid>
				<description><![CDATA[<p>The Fortress Archive will be unavailable Wednesday, April 1, 2026 from 8:00am - 12:00pm EDT for scheduled monthly maintenance (first Wednesday of every month).</p>
<p>During this time, Fortress will receive normal software and hardware updates. Any transfers which request files sent to or from Fortress will either block or fail until this maintenance is completed.</p>
]]></description>
				<pubDate>Wed, 01 Apr 2026 08:00:00 -0400</pubDate>
									<category>Maintenance</category>
							</item>
					<item>
				<title><![CDATA[Negishi Cluster Filesystem Interruption (Resolved)]]></title>
				<link>https://db.rcac.purdue.edu/news/7651</link>
				<guid isPermaLink="true">https://db.rcac.purdue.edu/news/7651</guid>
				<description><![CDATA[<p>Between approximately 9:30 PM and 10:45 PM EDT on March 30, 2026, the Negishi home file system experienced issues that prevented users from successfully connecting to login nodes.</p>
<p>Service was fully restored at 10:45 PM EDT, and login functionality has returned to normal. No data loss occurred.</p>
]]></description>
				<pubDate>Mon, 30 Mar 2026 21:30:00 -0400</pubDate>
									<category>Outages</category>
							</item>
					<item>
				<title><![CDATA[Temporary Impact to Data Depot Operations]]></title>
				<link>https://db.rcac.purdue.edu/news/7650</link>
				<guid isPermaLink="true">https://db.rcac.purdue.edu/news/7650</guid>
				<description><![CDATA[<p>We’re currently investigating an issue affecting IO operations to the Data Depot. Our monitoring has detected that the primary storage disks used for data writes are nearing capacity, which is causing IO failures for jobs writing to the Depot.</p>
<p>To prevent further impact, job submissions to clusters connected to Depot have been temporarily paused while the team works restore normal operation.</p>
<p><strong>Impact:</strong> Users may experience job delays or failures when writing to the Data Depot.</p>
<p><strong>Current Status:</strong> Our engineering team is actively addressing the capacity issue.</p>
<p><strong>Next Update:</strong> We will provide an update once the service is restored or we have more information to share.</p>
]]></description>
				<pubDate>Mon, 30 Mar 2026 09:30:00 -0400</pubDate>
									<category>Outages</category>
							</item>
					<item>
				<title><![CDATA[Weber Cluster Maintenance]]></title>
				<link>https://db.rcac.purdue.edu/news/7636</link>
				<guid isPermaLink="true">https://db.rcac.purdue.edu/news/7636</guid>
				<description><![CDATA[<p>As of 11:00am EDT, engineers have completed maintenance and have returned the Weber cluster back to normal service. Please report any issues to <a href="mailto:rcac-help@purdue.edu">rcac-help@purdue.edu</a></p>
<p>The Weber cluster will be unavailable Wednesday, March 18, 2026 from 8:00am - 5:00pm EDT for scheduled maintenance. The cluster will return to full production by 5:00pm EDT.</p>
<p>During this time, Weber will have its operating system patched.</p>
<p>Any Slurm jobs which request a walltime which would take them past Wednesday, March 18, 2026 at 8:00am EDT will not start and will remain in the queue until after the maintenance is completed.</p>
]]></description>
				<pubDate>Wed, 18 Mar 2026 08:00:00 -0400</pubDate>
									<category>Outages and Maintenance</category>
							</item>
					<item>
				<title><![CDATA[Power Outage Impacting Multiple Clusters — Recovery Underway]]></title>
				<link>https://db.rcac.purdue.edu/news/7640</link>
				<guid isPermaLink="true">https://db.rcac.purdue.edu/news/7640</guid>
				<description><![CDATA[<p>At approximately 6:00 AM EDT, a power outage impacted systems in the Math Data Center. Most services have now been restored.</p>
<p>Due to the outage, some jobs on Gilbreth did not requeue automatically. Users should check the status of any jobs that were running early this morning and resubmit them if needed.</p>
]]></description>
				<pubDate>Wed, 18 Mar 2026 06:00:00 -0400</pubDate>
									<category>Outages</category>
							</item>
					<item>
				<title><![CDATA[Negishi experiencing 2nd service disruption]]></title>
				<link>https://db.rcac.purdue.edu/news/7639</link>
				<guid isPermaLink="true">https://db.rcac.purdue.edu/news/7639</guid>
				<description><![CDATA[<p>We are again investigating an issue affecting Negishi. The system is currently unresponsive or unavailable for some users, including SSH access. We will provide an update when service is restored or as soon as we have additional information.</p>
]]></description>
				<pubDate>Tue, 17 Mar 2026 14:30:00 -0400</pubDate>
									<category>Outages and Maintenance</category>
							</item>
					<item>
				<title><![CDATA[Negishi experiencing service disruption]]></title>
				<link>https://db.rcac.purdue.edu/news/7638</link>
				<guid isPermaLink="true">https://db.rcac.purdue.edu/news/7638</guid>
				<description><![CDATA[<p>We are investigating an issue affecting Negishi. The system is currently unresponsive or unavailable for some users, including SSH access. We will provide an update when service is restored or as soon as we have additional information.</p>
]]></description>
				<pubDate>Tue, 17 Mar 2026 11:00:00 -0400</pubDate>
									<category>Outages and Maintenance</category>
							</item>
					<item>
				<title><![CDATA[Negishi Cluster Service Interruption]]></title>
				<link>https://db.rcac.purdue.edu/news/7629</link>
				<guid isPermaLink="true">https://db.rcac.purdue.edu/news/7629</guid>
				<description><![CDATA[<p>We are currently experiencing an unexpected outage affecting the Negishi cluster. We are actively working to restore service and will provide an update once the system is stable.</p>
<ul>
<li>
<p><strong>Impact:</strong> Users are currently unable to access their home directories, and SSH connections or active terminal sessions may freeze or be denied.</p>
</li>
<li>
<p><strong>Current action:</strong> System administrators are performing a system reboot of the affected storage infrastructure to clear the error and restore file access.</p>
</li>
</ul>
]]></description>
				<pubDate>Wed, 11 Mar 2026 15:00:00 -0400</pubDate>
									<category>Outages</category>
							</item>
					<item>
				<title><![CDATA[Weber Authentication Changes]]></title>
				<link>https://db.rcac.purdue.edu/news/7622</link>
				<guid isPermaLink="true">https://db.rcac.purdue.edu/news/7622</guid>
				<description><![CDATA[<p>As part of the efforts to certify Purdue as a CMMC Level 2 compliant institution, authentication to Weber, ThinLlinc access, and data egress will change during a maintenance scheduled for 8 AM -12 PM on March 5th, 2026.</p>
<p>** How does this change impact you? **</p>
<p>Login to Weber: The change will simplify logging into Weber. Weber users currently need to authenticate to Luna VPN or VDI with Luna credentials, and then additionally authenticate to Weber with BoilerAD credentials. After the change, users will just need to use their Luna password when logging into Weber resources, including via ThinlLinc, SMB, and SSH. Luna password will be the same password as is currently used to log into the VPN.</p>
<p>ThinlLinc: Anyone accessing Weber via ThinLinc directly from their endpoint will need to switch to using ThinLinc from a Luna Desktop.</p>
<p>Data Egress: File egress will be disabled, and users will need to submit a ticket each time they wish to transfer files outside weber. Egress to Apricon drives will no longer be supported on regular endpoints.</p>
<p>During the maintenance window, job scheduling though Slurm and network drive mounts of Weber storage will be unavailable.</p>
<p>These changes position Purdue to meet the requirements of CMMC Level 2 compliance and will be crucial for continuation of various grants and contracts.</p>
<p>Please reach out to <a href="mailto:rcac-help@purdue.edu">rcac-help@purdue.edu</a>? if you have any questions.</p>
]]></description>
				<pubDate>Thu, 05 Mar 2026 08:00:00 -0500</pubDate>
									<category>Maintenance</category>
							</item>
			</channel>
</rss>