LAM/MPI logo

LAM/MPI General User's Mailing List Archives

  |   Home   |   Download   |   Documentation   |   FAQ   |   all just in this list

From: Dilani Perera (dilani_at_[hidden])
Date: 2005-12-13 09:41:28


Hi,

My program works fine for upto 12 processors. But when its more than that
it gives the following error. What can be the reason ?

nuthatch% mpirun -v -np 14 /users/cs/grad/dilani/Research_2005/out A2000.txt
5074 /users/cs/grad/dilani/Research_2005/out running on n0 (o)
31791 /users/cs/grad/dilani/Research_2005/out running on n1
30726 /users/cs/grad/dilani/Research_2005/out running on n2
31488 /users/cs/grad/dilani/Research_2005/out running on n3
30166 /users/cs/grad/dilani/Research_2005/out running on n4
30569 /users/cs/grad/dilani/Research_2005/out running on n5
30479 /users/cs/grad/dilani/Research_2005/out running on n6
30330 /users/cs/grad/dilani/Research_2005/out running on n7
30218 /users/cs/grad/dilani/Research_2005/out running on n8
30695 /users/cs/grad/dilani/Research_2005/out running on n9
31426 /users/cs/grad/dilani/Research_2005/out running on n10
30978 /users/cs/grad/dilani/Research_2005/out running on n11
16466 /users/cs/grad/dilani/Research_2005/out running on n12
16145 /users/cs/grad/dilani/Research_2005/out running on n13
size :2000,
MPI_Recv: process in local group is dead (rank 5, MPI_COMM_WORLD)
MPI_Recv: process in local group is dead (rank 6, MPI_COMM_WORLD)
MPI_Recv: process in local group is dead (rank 8, MPI_COMM_WORLD)
MPI_Recv: process in local group is dead (rank 9, MPI_COMM_WORLD)
MPI_Recv: process in local group is dead (rank 4, MPI_COMM_WORLD)
MPI_Recv: process in local group is dead (rank 3, MPI_COMM_WORLD)
MPI_Recv: process in local group is dead (rank 7, MPI_COMM_WORLD)
MPI_Recv: process in local group is dead (rank 10, MPI_COMM_WORLD)
MPI_Recv: process in local group is dead (rank 2, MPI_COMM_WORLD)
MPI_Recv: process in local group is dead (rank 11, MPI_COMM_WORLD)
MPI_Recv: process in local group is dead (rank 12, MPI_COMM_WORLD)
MPI_Recv: process in local group is dead (rank 13, MPI_COMM_WORLD)
Rank (5, MPI_COMM_WORLD): Call stack within LAM:
Rank (6, MPI_COMM_WORLD): Call stack within LAM:
Rank (8, MPI_COMM_WORLD): Call stack within LAM:
Rank (9, MPI_COMM_WORLD): Call stack within LAM:
Rank (4, MPI_COMM_WORLD): Call stack within LAM:
Rank (3, MPI_COMM_WORLD): Call stack within LAM:
Rank (7, MPI_COMM_WORLD): Call stack within LAM:
Rank (10, MPI_COMM_WORLD): Call stack within LAM:
Rank (2, MPI_COMM_WORLD): Call stack within LAM:
Rank (11, MPI_COMM_WORLD): Call stack within LAM:
Rank (5, MPI_COMM_WORLD): - MPI_Recv()
Rank (6, MPI_COMM_WORLD): - MPI_Recv()
Rank (8, MPI_COMM_WORLD): - MPI_Recv()
Rank (12, MPI_COMM_WORLD): Call stack within LAM:
Rank (9, MPI_COMM_WORLD): - MPI_Recv()
Rank (4, MPI_COMM_WORLD): - MPI_Recv()
Rank (3, MPI_COMM_WORLD): - MPI_Recv()
Rank (13, MPI_COMM_WORLD): Call stack within LAM:
Rank (7, MPI_COMM_WORLD): - MPI_Recv()
Rank (10, MPI_COMM_WORLD): - MPI_Recv()
Rank (2, MPI_COMM_WORLD): - MPI_Recv()
Rank (11, MPI_COMM_WORLD): - MPI_Recv()
Rank (12, MPI_COMM_WORLD): - MPI_Recv()
Rank (13, MPI_COMM_WORLD): - MPI_Recv()
Rank (5, MPI_COMM_WORLD): - main()
Rank (6, MPI_COMM_WORLD): - main()
Rank (8, MPI_COMM_WORLD): - main()
Rank (9, MPI_COMM_WORLD): - main()
Rank (4, MPI_COMM_WORLD): - main()
Rank (3, MPI_COMM_WORLD): - main()
Rank (7, MPI_COMM_WORLD): - main()
Rank (10, MPI_COMM_WORLD): - main()
Rank (2, MPI_COMM_WORLD): - main()
Rank (11, MPI_COMM_WORLD): - main()
Rank (12, MPI_COMM_WORLD): - main()
Rank (13, MPI_COMM_WORLD): - main()
-----------------------------------------------------------------------------
One of the processes started by mpirun has exited with a nonzero exit
code. This typically indicates that the process finished in error.
If your process did not finish in error, be sure to include a "return
0" or "exit(0)" in your C code before exiting the application.

PID 5074 failed on node n0 (134.153.50.235) due to signal 15.
-----------------------------------------------------------------------------
nuthatch% size :2000,
size: ':2000,': No such file
nuthatch% cs/grad/dilani/Research_2005/out A2000.txt
   <
/bin/ksh: nuthatch%: not found
nuthatch% 5074 /users/cs/grad/dilani/Research_2005/out running on n0 (o)
/bin/ksh: syntax error: `(' unexpected
nuthatch% 31791 /users/cs/grad/dilani/Research_2005/out running on n1
/bin/ksh: 31791: not found
nuthatch% 30726 /users/cs/grad/dilani/Research_2005/out running on n2
/bin/ksh: 30726: not found
nuthatch% 31488 /users/cs/grad/dilani/Research_2005/out running on n3
/bin/ksh: 31488: not found
nuthatch% 30166 /users/cs/grad/dilani/Research_2005/out running on n4
/bin/ksh: 30166: not found
nuthatch% 30569 /users/cs/grad/dilani/Research_2005/out running on n5
/bin/ksh: 30569: not found
nuthatch% 30479 /users/cs/grad/dilani/Research_2005/out running on n6
/bin/ksh: 30479: not found
nuthatch% 30330

Thanks.

Dilani Perera.
(MSC Candidate for Computational Sciences)
Department of Computer Science,
St. John's, NL
Canada,A1B 3X5
Tel: 709-737-6142 (office)

email : dilani_at_[hidden]
Visit me at : www.cs.mun.ca/~dilani