Minimal changes

lorenzotomada · lorenzotomada · commit 77b750ba923d · 2025-07-01T17:02:32.000+02:00
diff --git a/README.md b/README.md
@@ -12,7 +12,7 @@ This is done following an efficient strategy specialized for symmetric matrices,
 The implementation of the solver is done using `mpi4py`. Moreover, the package relies on a `C++` backend that is automatically compiled when running `python -m pip install .`.
 A more detailed discussion on dependencies and on how to install the package is provided at the end of the `README.md` file.
 ## Repo structure
-We implemented various GitHub workflows, which include unit testing, documentation generation and code formatting.
+We implemented various `GitHub` workflows, which include unit testing, documentation generation and code formatting.
 
 1. Unit tests are performed using `pytest`. They are run automatically after each push. There are three test files in the `test` folder, namely `test_eigensolvers.py` (using to test the implementation of the Lanczos method and the QR algorithm), `test_zero_finder.py` (used to ensure correctness of helper functions for the divide et impera algorithm), and `test_utils.py` (to test that some helper functions work as expected).
 2. All the code is commented in detail in terms of docstrings and comments corresponding to the most salient lines of code. The documentation is generated automatially using `sphinx` at each push and deployed to `GitHub` pages.
@@ -31,12 +31,17 @@ In order to solve an eigenvalue problem, we considered multiple strategies.
 1. The most trivial one was to implement the power method in order to be able to compute (at least) the biggest eigenvalue. We then used `numba` to try and optimize it, but in this case just-in-time compilation was not extremely beneficial.The implementation of the power method is contained in `eigenvalues.py`.
 2. Lanczos + QR: this is an approach (tailored to the case of symmetric matrices) to compute *all* the eigenvalues and eigenvectors. Notice that, also in the case of the QR method,`numba` was not very beneficial in terms of speed-up, resulting in a pretty slow methodology. For this reason, we implemented the QR method in `C++` and used `pybind11` to expose it to `Python`. All the code written in `C++` can be found in `cxx_utils.cpp`.
 3. `CuPy` implementation of all of the above: we implemented all the above methodologies using `CuPy` to see whether using GPU could speed up computations. Since this was not the case, we commented all the lines of code involving `CuPy`, so that installation of the package is no longer required and we can use our code also on machines that do not have GPU.
-4. The core of the project is the implementation (as well as a generalization of the simplified case in which $\rho=1$ considered in our reference) of the _divide et implera_ method for the computation of eigenvalues of a symmetric matrix. Some helpers were originally written in `Python` and then translated to `C++` for efficiency reasons: their original implementation is in `zero_finder.py` and is still present in the project for testing purposes. The translated version can be found in `cxx_utils.cpp`. Instead, the implementation of the actual method to compute the eigenvalues starting from a tridiagonal matrix is contained in `parallel_tridiag_eigen.py` and makes use of `mpi4py`. Notice that the implementation of deflation in `cxx_utils.cpp` is done using the `Eigen` library.
+4. The core of the project is the implementation (as well as a generalization of the simplified case in which $\rho=1$ considered in our reference) of the _divide et impera_ method for the computation of eigenvalues of a symmetric matrix. Some helpers were originally written in `Python` and then translated to `C++` for efficiency reasons: their original implementation is in `zero_finder.py` and is still present in the project for testing purposes. The translated version can be found in `cxx_utils.cpp`. Instead, the implementation of the actual method to compute the eigenvalues starting from a tridiagonal matrix is contained in `parallel_tridiag_eigen.py` and makes use of `mpi4py`. Notice that the implementation of deflation in `cxx_utils.cpp` is done using the `Eigen` library.
 
 # Results
 The results of the profiling (runtime vs matrix size, memory consumption, scalability, and so on) are discussed in detail in `Documentation.ipynb`.
 All the scripts in the `scripts` folder are either used for profiling or to provide running examples.
 
+## Important remark
+The method that we implemented was tested thoroughly at all stages of development using `pytest`.
+Nevertheless, the algorithm that we chose seems to lack robustness, meaning that there exist some matrices for which the results are not accurate (even though most of the times they are).
+We are convinced that this issue is related to stability issues, as is fairly common in numerical linear algebra.
+
 # How to run
 We provide an example of running code in the `script` folder.
 Assuming that you are in the root folder of the project, it sufficies to use
@@ -58,6 +63,14 @@ Notice, however, that due to Ulysse's problems with `MPI` the profiling for
 As a result, we also provide `submit.sh`, which is supposed to be run on a workstation.
 It executes `mpirun -np [n_procs] python scripts/profile_memory.py`, basically doing the same as the `submit.sbatch` script, but without using `SLURM`.
 Notice that it assumes that `shell/load_modules.sh` has already been executed (see the next section).
+Examples:
+```bash
+sbatch shell/subsmit.sbatch
+```
+and
+```bash
+./shell/submit.sh
+```
 
 We also remark that the script to perform memory profiling `scripts/profile_memory.py` does not spam an `MPI` communicator, but is supposed to be called using `mpirun`. The reason for that is to provide a more extensive list of examples of how our package can be used.
 
diff --git a/experiments/config.yaml b/experiments/config.yaml
@@ -1,4 +1,4 @@
-dim: 500
+dim: 200
 density: 0.1
 n_processes: 2
 plot: false
diff --git a/scripts/mpi_running.py b/scripts/mpi_running.py
@@ -6,11 +6,11 @@
 import sys
 import numpy as np
 import argparse
-from pyclassify.utils import read_config, poisson_2d_structure, make_symmetric
+from pyclassify.utils import read_config, make_symmetric
 from pyclassify.eigenvalues import Lanczos_PRO
 
 
-seed = 8422
+seed = 84
 np.random.seed(seed)
 
 
@@ -59,11 +59,6 @@ def compute_eigvals(A, n_procs):
 density = kwargs["density"]
 n_procs = kwargs["n_processes"]
 
-# You could use (for low values of dim, else accuracy suffers):
-# A = poisson_2d_structure(dim)
-# A_np = A.toarray()
-
-# Alternatively, consider for instance:
 eig = np.arange(1, dim + 1)
 A = np.diag(eig)
 U = scipy.stats.ortho_group.rvs(dim)
diff --git a/scripts/profiling_memory.py b/scripts/profiling_memory.py
@@ -4,7 +4,6 @@
     make_symmetric,
     profile_numpy_eigvals,
     profile_scipy_eigvals,
-    poisson_2d_structure,
 )
 from pyclassify.parallel_tridiag_eigen import parallel_tridiag_eigen
 
@@ -20,7 +19,7 @@
 from mpi4py import MPI
 
 
-seed = 8422
+seed = 84
 np.random.seed(seed)
 
 
@@ -53,8 +52,13 @@
 # Now we build the matrix on rank 0
 # It is a scipy sparse matrix with the structure of a 2D Poisson problem matrix obtained using finite differences
 if rank == 0:
-    A = poisson_2d_structure(dim)
-    A_np = A.toarray()
+    eig = np.arange(1, dim + 1)
+    A = np.diag(eig)
+    U = scipy.stats.ortho_group.rvs(dim)
+
+    A = U @ A @ U.T
+    A = make_symmetric(A)
+    A_np = A
 else:
     A_np = None
 
diff --git a/scripts/run.py b/scripts/run.py
@@ -35,7 +35,7 @@
 
 t_s = time()
 eigvals, eigvecs = parallel_tridiag_eigen(
-    main_diag, off_diag, comm=child_comm, min_size=1, tol_factor=1e-10
+    main_diag, off_diag, comm=child_comm, min_size=1, tol_factor=1e-14
 )
 t_e = time()
 

Original file line number	Diff line number	Diff line change
`@@ -35,7 +35,7 @@`
`35`	`35`
`36`	`36`	`t_s = time()`
`37`	`37`	`eigvals, eigvecs = parallel_tridiag_eigen(`
`38`		`- main_diag, off_diag, comm=child_comm, min_size=1, tol_factor=1e-10`
	`38`	`+ main_diag, off_diag, comm=child_comm, min_size=1, tol_factor=1e-14`
`39`	`39`	`)`
`40`	`40`	`t_e = time()`
`41`	`41`