-
-
May 16, 2023 at 11:17 pmAlexandre MongrainSubscriber
Hi,
I'm currently making a large model to that simulates a lot of contacts in-between beams (a cable analysis), and I have been successfully able to simulate small cables. However, when I scale my model (more wires and longer cables), I reach a point where ANSYS just crashes without any error. All contacts are separated in pairs of likely-to contact beam regions, so I have >100k real sets in my model.
I have a workstation with 512Gb of RAM, and I get crashes with models that requires more than 200Gb of for in-core mode. The crash almost always happen during the first iteration or just before the it, and on rare cases, after 5-6 iterations. I also tried to force the out-of-core memory mode, but it crashes also, at the same place.
Here is an example of the last lines of a log file that crashed :Â
DISTRIBUTED SPARSE MATRIX DIRECT SOLVER.
 Number of equations =   19535375,   Maximum wavefront =   243
 Process memory allocated for solver        =   11.275 GB
 Process memory required for in-core solution   =   66.652 GB
 Process memory required for out-of-core solution =   10.907 GB
 Total memory allocated for solver         =   86.199 GB
 Total memory required for in-core solution    =  463.498 GB
 Total memory required for out-of-core solution  =   83.486 GB
 *** WARNING ***             CP =   2165.953  TIME= 21:18:22
 The Distributed Sparse Matrix Solver is currently running in the    Â
 out-of-core memory mode.  This memory mode may provide significantly  Â
 worse performance compared to the in-core memory mode, depending on   Â
 the amount of available system memory and I/O speed.  Please monitor  Â
 the solver performance to ensure that the large amount of I/O to the  Â
 solver files does not create a bottleneck for performance.  ÂFor reference, this analysis was using the distributed sparse matrix solver, with 32 cores (CPU is a Ryzen Threadripper PRO 5975WX).Â
I also tested the shared memory mode and running with and without the GUI and there is no difference.
I also, rarely, get the following error, but not in a reproductible way :
*** ERROR ***Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â CP =Â Â Â 3186.766Â Â TIME= 13:57:02
An error has occurred in the Sparse Matrix Solver while factoring the  Â
matrix. Error code = 154. Please send the data leading to this       Â
operation to your technical support provider, as this will allow       Â
ANSYS, Inc to improve the program.ÂAny thoughts?
Thanks !
-
May 17, 2023 at 2:08 pmmrifeAnsys Employee
Alexandre
Can you post the actual error reported in the "crash" that goes along with the first posted image (the standard memory allocation information screen shot). I've seen similar issues on similar AMD CPUs so try this; in the BIOS turn off Simultaneous Multithreading then redo the solve...what happens?
-
May 19, 2023 at 3:32 pmAlexandre MongrainSubscriber
I tried again after turning off the Simultaneous Multithreading option, it did not change anything. It crashed at the first iteration with no error.
-
-
May 18, 2023 at 5:22 pmAlexandre MongrainSubscriber
Thanks for your answer,
There is absolutely no error message when ANSYS crashes using the GUI, the log file ends with the memory allocation information with no accompanying error. Sometimes, some information from a first completed iteration is also written to the log file, and it just hangs for an hour or two before closing itself. When I start my job from a command line, I get the following on the console when it crashes, but I think it is only a generic message :
===================================================================================
= Â BAD TERMINATION OF ONE OF YOUR APPLICATION PROCESSES
= Â RANK 0 PID 19180 RUNNING AT MEC053695
= Â EXIT STATUS -1 (ffffffff)
===================================================================================
===================================================================================
= Â BAD TERMINATION OF ONE OF YOUR APPLICATION PROCESSES
= Â RANK 1 PID 12748 RUNNING AT MEC053695
= Â EXIT STATUS -1073741819 (c0000005)
===================================================================================
===================================================================================
= Â BAD TERMINATION OF ONE OF YOUR APPLICATION PROCESSES
= Â RANK 2 PID 13072 RUNNING AT MEC053695
= Â EXIT STATUS -1 (ffffffff)
===================================================================================
===================================================================================
= Â BAD TERMINATION OF ONE OF YOUR APPLICATION PROCESSES
= Â RANK 3 PID 18524 RUNNING AT MEC053695
= Â EXIT STATUS -1 (ffffffff)
===================================================================================
===================================================================================
= Â BAD TERMINATION OF ONE OF YOUR APPLICATION PROCESSES
= Â RANK 4 PID 12372 RUNNING AT MEC053695
= Â EXIT STATUS -1 (ffffffff)
===================================================================================
===================================================================================
= Â BAD TERMINATION OF ONE OF YOUR APPLICATION PROCESSES
= Â RANK 5 PID 18528 RUNNING AT MEC053695
= Â EXIT STATUS -1 (ffffffff)
===================================================================================
===================================================================================
= Â BAD TERMINATION OF ONE OF YOUR APPLICATION PROCESSES
= Â RANK 6 PID 19092 RUNNING AT MEC053695
= Â EXIT STATUS -1 (ffffffff)
===================================================================================
===================================================================================
= Â BAD TERMINATION OF ONE OF YOUR APPLICATION PROCESSES
= Â RANK 7 PID 18168 RUNNING AT MEC053695
= Â EXIT STATUS -1 (ffffffff)
===================================================================================I will try to change the BIOS setting tomorrow and re-test, and I will come back with the results.
-
May 19, 2023 at 3:48 pmmrifeAnsys Employee
What is the OS and version used. Are you submitting to a job scheduler and if so do you have the scheduler log for the job?
-
May 19, 2023 at 3:54 pmAlexandre MongrainSubscriber
I'm using a Windows workstation with ANSYS 2022R2, I'm submitting my job directly with a command prompt.Â
I get a log with the -o that ends, when the process crash, either with the memory informations or with some information from the first iteration.
Other than that, I get the "BAD TERMINATION" as in my previous reply inside the command prompt.
-
-
May 19, 2023 at 4:49 pmmrifeAnsys Employee
Alexandre - are you an Ansys customer? Or are using a Academic license? We are getting to the point of needing to get the input file (as stated in that Error message you posted). However before that is there any CPU or RAM over-clocking set on the system? Can you try another application to stress-test the system, like Prime95?Â
https://en.wikipedia.org/wiki/Prime95
https://www.mersenne.org/download/#stresstest
Â
-
May 19, 2023 at 5:41 pmAlexandre MongrainSubscriber
I use a academic research license, I'm using ANSYS for my masters project. The CPU and RAM clocks are factory default.
I will try to stress-test the system with Prime95. I'm not sure that it may be a hardware problem, though. I have successfully ran different, big models (not cables) on this machine. I suspect it may have to do with having a very large number of realsets (for each contact pair), or something like this.
I can share an input file, just to check if it runs successfully on another machine. Can I send it to you directly ? However, I really don't expect my model to converge as it is.
-
-
May 22, 2023 at 1:59 pmmrifeAnsys Employee
Alexandre - which institution do you attend?
-
May 23, 2023 at 2:18 pmAlexandre MongrainSubscriber
I attend ÉTS school in Québec, Canada
-
-
May 23, 2023 at 2:39 pmmrifeAnsys Employee
Alexandre - which department? Â
-
May 23, 2023 at 2:41 pmAlexandre MongrainSubscriber
Mechanical engineering department
-
-
May 23, 2023 at 2:55 pmmrifeAnsys Employee
Alexandre - I reached out to the ASC (Ansys Support Coordinator) for that department asking them to forward you an email. If they do forward the email please reply to me directly.
Mike
-
- The topic ‘ANSYS APDL crash with large contact analysis’ is closed to new replies.
- Problem with access to session files
- Ayuda con Error: “Unable to access the source: EngineeringData”
- Reaction forces and moments during random vibration at local coordinate systems
- At least one body has been found to have only 1 element in at least 2 directions
- Using APDL to extract stresses on a beam element.
- How to select the interface delamination surface of a laminate?
- Geometric stiffness matrix for solid elements
- Timestep range set for animation export
- Non-linear convergence issue
- Computation Accleration
-
1131
-
468
-
466
-
225
-
201
© 2024 Copyright ANSYS, Inc. All rights reserved.