ABINIT. List of features. 
This file describes the major features of the ABINIT package.
Copyright (C) 1998-2016 ABINIT group (DCA,XG)
 This file is distributed under the terms of the GNU General Public License, see
~abinit/COPYING or 
http://www.gnu.org/copyleft/gpl.txt .
 For the initials of contributors, see ~abinit/doc/developers/contributors.txt .
Contents: 
 - 0. Related documentation.
 
- 1. Available physical properties.
  
   - 1.A. Total energy of an assembly of nuclei and electrons placed in a repeated cell.
- 1.B. Derivatives of the total energy and eigenenergies (forces, electric field, phonons, ...).
- 1.C. Excited states and other electronic properties.
- 1.D. Displacement of atoms, and changes of cell parameters.
- 1.E. Properties at the nuclei sites.
- 1.F. Analysis and graphical tools.
- 1.G. Interfacing with other codes.
- 1.H. Verification (cross checking with other codes).
 
- 2. Speed and memory.
  
   - 2.A. Speed in the sequential version.
- 2.B. Speed in the parallel version.
- 2.C. Memory.
 
- 3. The user's point of view.
  
   - 3.A. The Web site.
- 3.B. Portability.
- 3.C. Running jobs, input and output files.
- 3.D. Documentation.
- 3.E. Generation of the k-points, geometries, and starting wavefunctions.
- 3.F. Automatic determination of input parameters.
 
- 4. The programmer's point of view.
0. Related documentation
Besides the documentation available on the Web,
there is a wealth of information inside the package itself, see the
directory "doc".
The reader might consult the latest version of the "doc/users/context.txt" file
for the description of the ABINIT project and its history.
The different versions of the "doc/release_notes/*html" files will allow to
see the actual development of the project since version 1.5,
released in August 1998. They give also an idea of on-going developments.
Discussions on the forum might also 
allow to get documentation.
1. Available physical properties
1.A. Computation of the total energy of an assembly of nuclei and electrons
placed in a repeated cell.
- 
1.A.1. The computation can be done using (1) plane waves and norm-conserving
pseudopotentials, or, (2) so-called
"Projector-Augmented Waves" (PAW method), with appropriate pseudoatomic data,
or (3) wavelets, see 1.A.14.
- 
1.A.2. The total energy computation is done according to the Density Functional
Theory (also +U and DMFT, see 1.A.15 and 1.A.20). 
Many exchange-correlation functionals are available,
through two different implementations : one is the native ABINIT implementation,
the other is the ETSF library of XC functionals (LibXC is a plug-in to ABINIT).
In the native ABINIT set,
most of the important local approximations (LDA) are
available, including the Perdew-Zunger one. Two different local spin density
(LSD) are available, including the Perdew Wang 92, and one due to M. Teter.
The Perdew-Burke-Ernzerhof,  the revPBE, the RPBE
and the HCTH GGAs (spin unpolarized as well as polarized)
are also available.
In the LibXC 2.0 library, as interfaced with ABINIT,
there are 24 functional forms of the 3D LDA type, and over 80 functional forms of the GGA type.
They can be used with norm-conserving pseudopotentials as well as PAW atomic data.
Also, some metaGGA can be used with ABINIT (norm-conserving case only).
For response-function type calculations, the native ABINIT LDA and GGA kernels 
can be used as well as the LibXC ones.
The exchange can also be computed on the basis of the Fock expression (exact exchange), and the correlation
can be computed on the basis of the RPA approximation (see the GW section).
Hybrid functionals calculations (HSE06, PBE0, B3LYP) can be performed.
The implementation of the exact exchange, correlation and hybrid does not deliver the forces and stresses at present,
at the exception of forces with norm-conserving pseudopotentials.
Several Van Der Waals functionals are available : Grimme (D2, D3, D3(Becke-Johnson)), Silvestrelli.
- 
1.A.3. Self-consistent calculations will generate the DFT ground-state,
with associated energy and density. Afterwards, a non-self-consistent
calculation might generate eigenenergies at a large number
of k-points, for band structures. The electronic Density-Of-States
can be computed either within the tetrahedron method, or using a smearing
technique.
- 
1.A.4. The program admits many different types of pseudopotentials.
There are three complete sets of norm-conserving pseudopotentials available for
most elements of the periodic table. The recommended one (GGA) comes from the ONCVPSP generator (with spin-orbit coupling).
Another one is of the Troullier-Martins type (LDA and GGA),
one of the Goedecker type (LDA - this one include the spin-orbit coupling).
Three large sets of PAW atomic data can be read by ABINIT.
One is from GPAW (0.9), another one has been developed by ABINIT developers,
see http://arxiv.org/abs/1309.7274.
 a third one by  K.F. Garrity, J.W. Bennet, K.M. Rabe, D. Vanderbilt, arXiv:1305.5973v1
Both are in the PAW XML format. They have been tested against all-electron
calculations, see http://molmod.ugent.be/DeltaCodesDFT 
Most elements in the periodic table are present, except the rare earth elements and actinides.
Five codes are available to generate
new pseudopotentials when needed. Two of them are able to generate
pseudopotentials with a core hole, in order to compute core-level shifts.
(see the ABINIT web site).
Two of them are able to generate GGA pseudopotentials.
The ultra-soft pseudopotentials are approximations of the
Projector Augmented Wave approach, and are treated within this framework.
Norm-conserving pseudopotentials can be mixed, to generate "alchemical" pseudoatoms.
See also section 1.G.8, for the special case of the AtomPAW generator.
- 
1.A.5. Metallic as well as insulating systems can be treated. Schemes for
determination of the occupation number include the Fermi broadening,
the Gaussian broadening, the Gaussian-Hermite broadening, as well as
the modifications proposed by Marzari. Finite temperatures
can also be treated using a smearing scheme (Verstraete scheme).
- 
1.A.6. The cell may be orthogonal or non-orthogonal.
Any kind of symmetries and corresponding sets of k-point can be input,
and taken into account in the computation.
- 
1.A.7. The electronic system may be computed in the
spin-unpolarized or spin-polarized
case, with the possibility to impose occupation numbers of majority and
minority spins, and the spins of the starting configuration.
A specific option for efficient treatment of
anti-ferromagnetism (Shubnikov groups) is available.
The treatment of non-collinear magnetism is available.
The total magnetic moment of the unit cell can be constrained.
The local magnetization can also be constrained.
- 
1.A.8. The total energy, forces, stresses, and electronic structure
can be provided with the spin-orbit coupling included.
- 
1.A.9. A finite electric field can be imposed on insulators, using the proper
Hamiltonian for periodic systems based on Berry's phase. The generalisation
to fixed D-field or fixed reduced fields are also available,
as described in M. Stengel, N.A. Spaldin and D. Vanderbilt, Nat. Phys. 5,304 (2009).
- 
1.A.10. The decomposition of energy in its different component (local potential,
XC, hartree ...) is provided.
- 
1.A.11. Inner electronic eigenvalues can be computed thanks to the
minimisation of the residual with respect to a target energy value.
- 
1.A.12. The charge density of states close to the Fermi energy can be output,
and provide a simple approach to STM image calculations (not for PAW).
The angular momentum decomposed DOS can be evaluated in a sphere around any point,
and provide STS analysis.
- 
1.A.13. The lifetime of positrons and their annihilation rate can be computed,
using the self-consistent two-component DFT, including force and stress computation,
LDA and GGA, norm-conserving as well as PAW.
Doppler broadening can also be computed.
- 
1.A.14. The use of wavelets (instead of plane waves) is also being implemented.
At present, it works for the SCF cycles : one can get density, total energy
and forces, in some basic cases (HGH or GTH pseudopotentials only, or PAW - in v7.2).
Available also : the finite size corrections to the
total energy, restart on wavefunctions following the ETSF norm and geometry
relaxation using BFGS. Molecular dynamic is also available for
test purposes.
- 
1.A.15. The DFT+U approximation is also available (LDA+U, GGA+U), on the basis
of PAW calculations.
Available for non-spin-polarized, spin-polarized, and spinor cases (with spin-orbit also).
The U and J values can be determined using the method by Coccocioni and De Gironcoli,
as well as from a cRPA (constrained RPA) calculation.
- 
1.A.16. The local exchange (atomic exact exchange) approximation is also available, on the basis
of PAW calculations.
- 
1.A.17. Possibility to add a jellium slab in the cell.
- 
1.A.18. The kinetic energy density can be computed, as well as the electron localisation function
        (in the norm-conserving pseudopotential case). The ELF is not available in the non-collinear spin case.
- 
1.A.19. The Van der Waals energy can be added at postprocessing time, following the Silvestrelli method (and
variation thereof), as well as the vdW-DFT-D2 or D3 or D3(BJ)  (Grimme) semi-empirical potential.
- 
1.A.20. Dynamical mean-field calculations (non-self consistent and self-consistent) can be performed.
In order to solved the Anderson impurity model, the continuous-time Quantum Monte Carlo algorithm is used.
1.B. Derivatives of the total energy and eigenenergies (incl. linear and non-linear DFPT responses)
- 
1.B.1. Hellman-Feynman forces are computed from an analytical formula,
and corresponds exactly to the limit of finite differences of energy
for infinitesimally small atomic displacements when the ground-state
calculation is at convergence. This feature is available for all the
cases where the total energy can be computed.
A correction for non-converged
cases allows to get accurate forces with less converged wavefunctions
than without it. The decomposition of the forces in their different
components can be provided.
- 
1.B.2. Stress can also be computed. This feature is available for all
the cases where the total energy can be computed (except wavelets).
The decomposition of the stresses in their different
components can be provided. A smearing scheme applied to the kinetic
energy allows to get smooth energy curves as a function of
lattice parameters and angles.
- 
1.B.3. The polarization can be computed within the Berry phase
formulation. This feature is available for insulators,
magnetic or non-magnetic, and also when spin-orbit splitting
is present. The localisation tensor can also be computed.
Constrained-polarization calculations can be performed, following Na Sai
et al, PRB 66, 104108 (2002).
- 
1.B.4. For norm-conserving pseudopotentials and PAW, 
computation of the derivative with respect to the wavevector 
(d/dk - DDK perturbation) is available. This also includes the analytical
first derivative of the eigenvalues with respect to wavevector.
- 
1.B.5. For norm-conserving pseudopotentials, accurate responses
to atomic displacements and homogeneous electric fields
are available, and allows to compute the interatomic force constants,
the Born effective charges, the dielectric constant, the phonon band
structure. Symmetry characters of the phonons at Gamma
are computed. Thermodynamical properties, like the free energy,
the heat capacity or the entropy, can also be computed, in the
quasi-harmonic approximation.
The atomic temperature factors (Debye-Waller factors) are also available.
Phonon DOS and two-phonon DOS sum and difference spectra can be computed.
Available also for the native GGAs  of ABINIT (except the HCTH one),
including the Grimme Van der Waals fonctionals.
- 
1.B.6. For norm-conserving pseudopotentials, accurate responses
to strain perturbations are available,
and allow to compute, presently, the elastic constants
("rigid-ion" as well as "relaxed"), the piezoelectricity tensor,
as well as the so-called "internal strains", change of forces to atoms
due to a strain, or change of the stress due to an atomic displacement.
(no treatment of spin-orbit coupling yet).
Available also for the native GGAs of ABINIT (except the HCTH one),
including the Grimme Van der Waals fonctionals.
- 
1.B.7. For norm-conserving pseudopotentials, approximate or accurate susceptibility matrix
and dielectric matrix can be computed, thanks to a sum over states.
(no treatment of spin-polarized cases or spin-orbit coupling yet)
- 
1.B.8. For norm-conserving pseudopotentials and PAW, derivatives of the
electronic eigenenergies with respect
to the wavevector can be computed analytically.
- 
1.B.9. For norm-conserving pseudopotentials and PAW, optical conductivity
can be computed, thanks to the Kubo-Greenwood formula.
(no treatment of spin-orbit coupling yet)
- 
1.B.10. For norm-conserving pseudopotentials,
the band-by-band decomposition of the Born effective charges,
and the localisation tensor is available.
(no treatment of spin-polarized cases or spin-orbit coupling yet)
- 
1.B.11. For norm-conserving pseudopotentials, selected non-linear
response coefficients can be computed thanks to the
2n+1 theorem of perturbation theory, for electric field frequencies that are low compared to the gap. 
At present :
- the electro-optic coefficients 
   (no treatment of spin-polarized cases or spin-orbit coupling yet)
- the Raman cross sections
   (no treatment of spin-orbit coupling yet)
This is available for the non-spin-polarized case and the
spin-polarized case, but only in some flavors of LDA at present.
- 
1.B.12. For norm-conserving pseudopotentials,
electron-phonon matrix elements are available directly from the
output of a linear-response phonon calculation. They can be analyzed
to give e.g. the phonon linewidth or superconducting temperature.
(no treatment of spin-polarized cases or spin-orbit coupling yet),
or the thermal conductivity (phonon transport, limited by electron-phonon
collisions), the electrical conductivity (electron transport,
limited by electron-phonon collisions), the Seebeck coefficient (needed for thermoelectricity).
The latter is available aat different levels, including the inelastic variational approximation
- 
1.B.13. For norm-conserving pseudopotentials,
the computation of responses with respect to atomic displacement
is possible when a finite electric field is present.
This feature is available for insulators,
non-magnetic, no spin-orbit coupling.
Computation of other linear or non-linear responses within
an electric field is not yet possible.
- 
1.B.14. For norm-conserving pseudopotentials,
the computation of the second derivative of the eigenvalues 
with respect to atomic displacements
giving access to the temperature dependence of the band-gap (in the DFT)
is possible, in the Allen-Heine-Cardona approach (either adiabatic or including dynamical effects).
1.C. Excited states and other electronic properties.
- 
1.C.1. Computation of ionisation energies (N -> N-1 electrons)
and affinities (N -> N+1 electrons) in the GW approximation.
Four plasmon pole models are available, but that calculation
can also be done by a full integration over frequencies.
ABINIT is able to treat insulators as well as metals, 
also polymers or clusters.
In the GW implementation, it is possible to disable the full computation,
and actually do an Hartree-Fock, screened exchange or COHSEX calculation.
Self-consistency is implemented at a rather primitive level.
As vertex corrections, the bootstrap kernel and others can be included in the self-consistent W.
The Faleev method, see PRL 93, 126406 (2004), is implemented.
Available for the non-spin-polarized as well as spin-polarized case,
but not yet for spinor wavefunctions.
Spectral method for set-up of the frequency-dependent matrices
is available.
GW is available for norm-conserving as well as PAW methodology.
Not wavelets.
Convergence over the number of unoccupied band is much improved
with respect to usual implementations of GW, thanks to the
"extrapolar" method.
The Coulomb interaction can be cut-off in real space, in order
to treat efficiently in the supercell methodology : finite systems
and 1D-periodic systems (like polymers, nanowires or nanotubes)
Different treatments of the exchange for the core electrons
have been implemented : either the core contribution is approximated with vxc
or the core term is approximated with the Fock operator inside the PAW spheres.
Available in norm-conserving as well as in PAW.
A fast method is available, in case of finite systems (base on Lanczos-Sternehimer technique)..
- 
1.C.2. Excited states of atoms, molecules and solids.
For finite systems (atoms and molecules), excited states 
can be computed within TDDFT (Casida approach - only norm-conserving
pseudopotentials). For all systems, the Bethe-Salpeter Equation (norm-conserving as well as PAW) can be used. 
Oscillator strengths are available. In the non-spin-polarized case, spin-singlet as well
as spin-triplet excitations are computed. Spin-polarized
case is also available. In the Bethe-Salpeter case, direct diagonalization is available
to access directly eigenenergies of excitations..
The Tamm-Dancoff approximation can be performed, but the full treatment (pseudo-hermitian problem) is available as well.
- 
1.C.3. Frequency-dependent linear optical response is available,
for all components of the tensor (xx,yy,zz,xy,yz,zx),
real as well as imaginary part. This property can be computed at different levels of theory.  
It might be computed within the very simple
"sum-over-state" approximation (i.e. without local field corrections).
See the "optics" utility, for norm-conserving pseudopotentials and PAW.
Alternatively, it might be computed using the Bethe-Salpeter Equation (in the main ABINIT code), 
and an iterative (Haydock) technique.
The Tamm-Dancoff approximation can be performed, but the full treatment (pseudo-hermitian problem) is available as well.
For Bethe-Salpeter, the Rohlfing-Louie interpolation technique is available, as well as other
interpolation techniques..
- 
1.C.4. The electronic contribution to the frequency-dependent non-linear optical response is available,
for all components of the tensor (xxx,yyy,zzz,xyz ... ),
real as well as imaginary part. This property is computed within the very simple
"sum-over-state" approximation (i.e. without local field corrections).
See the "optics" utility, for norm-conserving pseudopotentials only.
Second-harmonic generation can be computed as well as electro-optic effect (not including the ion motion contribution
in this case).
- 
1.C.5. Frequency-dependent electronic conductivity,
dielectric tensor, index of refraction, reflectivity, absorption,
the thermal conductivity, and the thermopower
(electron transport, high temperature, Kubo-Greenwood formalism)
real as well as imaginary part. This property is computed within the very simple
"sum-over-state" approximation (i.e. without local field corrections).
See the "conducti" utility, for norm-conserving pseudopotentials,
as well as PAW.
- 
1.C.6. Computation of the random stopping power, in the RPA and in the TD-LDA.
- 
1.C.7. The DFT effective mass tensor can be computed directly using DFPT, including the case of degeneracies..
1.D. Displacement of atoms, and changes of cell parameters.
- 
1.D.1. Different algorithms (Broyden; modified Broyden; Verlet with sudden
stop of atoms) allows to find the equilibrium configuration
of the nuclei, for which the forces vanish. The cell parameters
can also be optimized concurently with the atomic positions.
Specified lattice parameters, or angles, or atomic positions,
can be kept fixed if needed.
- 
1.D.2. Three molecular dynamics algorithm (Numerov, Verlet, Blanes and Moanes)
allow to perform simulations in real
(simulated) time. The displacement of atoms may be computed according to
Newton's law, or by adding a friction force to it.
Nose-Hoover thermostat is available with Verlet algorithm.
Langevin dynamics is also available.
- 
1.D.3. The string method is available, to compute transition paths between
two configurations/local minima of the Born-Oppenheimer hypersurface (typically reactants and products).
The Nudged Elastic Band method, with the same goal, is also available.
A genetic algorithm to find global optima is also available.
These make use of the "image" concept (a population of cells is treated). 
- 
1.D.4. The Path-Integral Molecular Dynamics in NVT ensemble is available
with two thermostats (Langevin and Nose-Hoover chains) and possible coordinate transformations (primitive, staging, normal modes).
- 
1.D.5. The code can provide an automatic analysis of bond lengths and angles,
and the atomic coordinates in a format suitable for vizualisation with XMOL.
- 
1.D.6. Response function from Molecular Dynamics.
Phonon spectrum obtained from trajectories (with a minimal set of coefficients)
Included in APPA.
1.E. Properties at the nuclei sites.
- 
1.E.1. Electric field gradients at nuclei sites are available.
(Only in the Projector Augmented Waves formalism).
- 
1.E.2. The Fermi contact term is also available (needed
to compute isomer shifts in Mossbauer spectroscopy)
(Only in the Projector Augmented Waves formalism).
- 
1.E.3. Magnetic nuclear dipoles can be introduced, to allow computation of the Nuclear Magnetic Shielding tensor
    through the converse method (change of magnetization due to magnetic nuclear dipole).
1.F. Analysis and graphical tools.
- 
1.F.1. A post-processor, called "cut3d", is available to analyse density,
potential and wavefunction files. It is able to change the format of these
files, to extract the data on a 2D plane, or along a 1D line.
It can perform the Hirshfeld computation of atomic charges.
It can analyse the charge contained in an atomic sphere,
and determine the angular momentum projected charge (l=0 to 4)
contained in that sphere.
(only available for norm-conserving pseudopotentials)
- 
1.F.2. Another post-processor, called "aim", is available to
perform the Bader "Atom-In-Molecule" analysis of the density.
(only available for norm-conserving pseudopotentials)
- 
1.F.3. A special part in the tutorial (see later) indicates how
to generate properly formatted data for the visualisation of :
 - the band structure (visualisation thanks to XMGR)
- total energies vs different parameters (also using XMGR)
- the charge density (3D isosurfaces)
 (the cut3d postprocessor must be used, followed by matlab)
The cut3d postprocessor also allows to prepare 2D charge density plots.
- 
1.F.4. The post-processor band2eps allows to draw phonon dispersion
curves automatically, in a file written in Encapsulated PostScript (eps).
Moreover, a color code allows to emphasize the contribution of individual
atoms to the corresponding eigenvector (at most three types of atoms).
- 
1.F.5. Utilities to view the structures and make the band structure
are available (StructureViewer, BandStructureMaker).
- 
1.F.6. The Fold2Bloch utility is available, to extract, from supercell computations (that include e.g.
    defects), the components (and weight) of wavefunctions in the underlying small primitive cell.
1.G. Interfacing with other codes.
- 
1.G.1. ABINIT is interfaced with WanT (see http://www.wannier-transport.org).
WanT, as a postprocessor of ABINIT results, allows to compute
maximally localized Wannier functions, as well as transport
properties through nanostructures.
- 
1.G.2. ABINIT is interfaced with CASINO, a code for Quantum Monte Carlo
calculation of the properties of solids and molecules.
- 
1.G.3. XML pseudopotentials can be used, these being common with the SIESTA
code.
- 
1.G.4. The _WFK or _KSS functions can be used as starting basis
by the TDDFT, GW and/or Bethe-Salpether softwares EXC, SELF and DP
code, see http://www.abinit.org/community/?text=links .
- 
1.G.5. ABINIT can produce and read files following the Nanoquanta/ETSF
file format (http://www.etsf.eu/fileformats)
- 
1.G.6. ABINIT is interfaced with Wannier90 (see http://www.wannier90.org).
Wannier90, as a postprocessor of ABINIT results, allows to compute
maximally localized Wannier functions, as well as transport
properties through nanostructures.
- 
1.G.7. ABINIT is interfaced with V_Sim , a structure and density visualisation tool.
(http://www-drfmc.cea.fr/sp2m/L_Sim/V_Sim/index.en.html)
- 
1.G.8. The PAW atomic data generator is a plugin of ABINIT. The integration 
is such that there are ABINIT automatic tests in which a PAW atomic data is generated, and then
used in ABINIT.
1.H. Verification (cross checking with other codes).
1.H.1. The ground state total energy, geometry relaxation, phonon frequencies, electron-phonon matrix elements,
temperature dependence of the electronic gap has been cross checked with Quantum Espresso and YAMBO. 
See S. Poncé and G. Antonius and P. Boulanger and E. Cannuccia and A. Marini and M. Côté and X. Gonze,
Verification of first-principles codes: Comparison of total energies, phonon frequencies, electron--phonon coupling and zero-point motion correction to the gap between ABINIT and QE/Yambo.
Computational Materials Science 83, 341 (2014).
1.H.2. It is possible to choose the same convention for the definition of the average eletrostatic potential, than Quantum Espresso,
allowing verification of results for charged systems. 
See F. Bruneval, J.-P. Crocombette, X. Gonze, B. Dorado, M. Torrent and F. Jollet
Consistent treatment of charged systems within periodic boundary conditions : the projector augmented-wave and pseudopotential methods revisited
Phys. Rev. B (accepted 23 December 2013)
- 
 
 2. Speed and memory.2.A. Speed in the sequential version.
- 
2.A.1. Depending on the number of atoms, there are two regimes in the
code : at low number of atoms and electrons,
the CPU time is dominated by Fast Fourier
Transforms with an average scaling O(N^2 log N) where N is some
number characteristics of the size of the system (atoms, electrons);
at large number of atoms and electrons, the CPU time is dominated
by non-local operator aplication and orthogonalisation, with
an average scaling O(N^3).
 
- 
2.A.2. The complex-to-complex Fast Fourier Transform routine for application
of the Hamiltonian has been highly optimized, and take into account
the fact that the wavefunction do not fill the reciprocal space FFT box.
Library FFTs are also available, but they are found to be slower than the
present FFT routine, developed starting from a routine
provided by S. Goedecker.
A real-to-complex FFT is used for treating potential and densities
of the ground state, since they are real.
For selected k-points, invariant under time-reversal symmetry,
( (0 0 0), (1/2 0 0), (0 1/2 0), (0 0 1/2), (1/2 1/2 0) ... (1/2 1/2 1/2) ),
the number of planewave explicitly treated is divided by two. A
real-to-complex FFT is used then.
 
- 
2.A.3. The non-local potential is applied in reciprocal space. It has been
optimized carefully, although there is still some speed-up to be
coded when the k-point is invariant under time-reversal.
The orthogonalisation procedure can be done twice per loop or only once.
 
- 
2.A.4. At the level of the generation of electronic eigenfunctions, an efficient
band-by-band preconditioned conjugate-gradient algorithm is used,
in its non-self-consistent version. An alternative algorithm "lobpcg"
is also available.
 
- 
2.A.5. At the level of the self-consistency loop, an efficient
potential-based preconditioned conjugate-gradient algorithm is used.
Simple mixing is also available, as well as the Anderson algorithm.
Preconditioning of this algorithm is achieved through a model
dielectric function, or through an approximate dielectric matrix.
 
 2.B. Speed in the parallel version.
- 
2.B.1. For ground-state calculations, MPI-based parallelism.
The code has been parallelized
on the k-points, on the spins, on the spinor components, on the bands,
and on the FFT grid and plane wave coefficients.
For the k-point and spin parallelisations (using MPI), the communication
load is generally very small. This allows it to be used on a cluster of workstations.
However, the number of nodes that can be used in parallel might
be small, and depends strongly on the physics of the problem.
A combined FFT / band parallelisation is available, and has shown
very large speed up (>1000) on powerful computers with a large number of processors
and high-speed interconnect. The combination of FFT / band / k point and spin
parallelism is also available, and quite efficient for such computers.
Available for norm-conserving as well as PAW cases.
Automatic determination of the best combination of parallelism levels is available.
Use of MPIIO is mandatory for the largest speed ups to be observed.
- 
2.B.2. For ground-state calculations, with a set of images (e.g. nudged elastic band method,
the string method, the path-integral molecular dynamics, the genetic algorithm), MPI-based parallelism.
The work load for the different images has been distributed, and this marallelism can be 
combined with the parallelism described in point 2.B.1, leading to speed-up beyond 5000.
- 
2.B.3. For ground-state calculations, GPU can be used. This is based on CUDA+MAGMA.
 
- 
2.B.4. For ground-state calculations, the wavelet part of ABINIT (BigDFT) is also very well parallelized :
MPI band parallelism, combined with GPUs.
- 
2.B.5. For response calculations, MPI-based parallelism.
The code has been parallelized
on k-points, spins, bands, as well as on perturbations.
For the k-points, spins and bands parallelisation,
the communication load is rather
small also, and, unlike for the GS calculations, the number
of nodes that can be used in parallel will be large,
nearly independently of the physics of the problem.
Parallelism on perturbations is very similar to the parallelism on images in the ground state case (so, very efficient),
although the load balancing problem for perturbations with different number of k points is not adressed at present.
Use of MPIIO is mandatory for the largest speed ups to be observed.
 
- 
2.B.6. GW calculations are MPI-parallelized over k-points.
They are also parallelized over transitions (valence to conduction band pairs), but the two parallelisation
cannot be used currently at present.
The transition parallelism has been show to allow speed ups as large as 300.
 
- 
2.B.7. Ground state, response function, and GW parallel calculations 
can be done also by using OpenMP parallelism, even combined with MPI parallelism. 
 
- 
2.C.1. The requirements of the different conjugate gradient algorithms
on memory are relatively low, especially when the number of atoms
is large. Optionally, it is even possible to use disk space
to save memory, at the expense of computing time.
In particular, when the number of k points is large, they can
be stored in memory one at a time. Phase factors in the application of
the non-local operator can also be recomputed at each application,
in order to save memory. For k-points that are invariant under
time-reversal symmetry, the storage required for wavefunctions
is half the storage for other k-point.
 
 
 
 3. The user's point of view.3.A. The Web site.
- 
3.A.1. A Web site can be accessed. The complete sources (and all the tests)
of the ABINIT package are available there.
Executables for many different platforms are also available,
in specific packages that also include
the 'Infos' directory are also available.
Installation notes, current features of ABINIT,
the tutorial, on-line help can be vizualized directly from the web.
 
- 
3.A.2. Also available from the Web site :
 
 -  the pseudopotentials 
-  some utilities (including cut3d, a density analyser) 
-  five mailing lists
(one for "official announcements", one for the developpers, one for the developers using the GNU Arch repository, one "forum mailing list" , one for the advisory committee) 
-  the ABINIT bibliography database, that contains references of papers
in which ABINIT or one of its predecessors have been used 
-  the FAQ (frequently asked questions) database 
-  ... (many more infos)
 
 
 3.B. Portability.
- 
3.B.1. The ABINIT package v6 is routinely installed (sequential and parallel) on different platforms :
 -  PC/Linux based on Intel Xeon processors or AMD processors
 with GFort compiler, g95 compiler, Intel compiler, pathscale compiler.
-  PC under Windows 
-  ItaniumII 
-  IBM RS6000 based on Power processors 
-  Mac OS X 
 
 
- 
3.B.2. Installation is made thanks to the autotools (autoconf and automake)
as well as different scripts. All machine-dependent parameters
are grouped in one single short file for each machine.
The parallel and sequential
version of the code, as well as the different versions for the different
machines, are prepared on-the-fly,
so that there is only one unique source code.
 
- 
3.B.3. Binaries for different machines can be managed in the
same main directory, as they might be placed automatically in different
sub-directories.
 
 3.C. Running jobs, input and output files.
- 
3.C.1. The input variables are gathered in one unique file,
read by a text processing facility build in the code.
Many defaults values are provided, so that
the input file can be kept rather short.
 
- 
3.C.2. Many different stopping criteria allow the user to target the accuracy
he or she wants to obtain.
 
- 
3.C.3. The outputs are provided to one main file and one auxiliary file,
as well as different specialized files (for density, potential,
wavefunctions, ...) .
The main file is shorter than the auxiliary file, and well formatted,
while all important results are gathered
there. It can be used for archival purposes. The auxiliary (log) file
will contain all exception messages.
 
- 
3.C.4. Exception handling is provided through four different types
of messages : COMMENT, WARNING, BUG and ERROR. In each case, the
accurate meaning of the exception is described, as well as the
eventual action to be taken by the user.
 
- 
3.C.5. Statistics of foreseen memory and disk usage is printed at
the beginning of the run. Statistics of CPU time usage
is printed at the end of the run.
 
- 
3.C.6. There is a facility to stop the run in a clean way at any time.
The user may specify a cpu time limit, after which the job must end
smoothly.
 
- 
3.C.7. A status file, updated very frequently, gives an on-the-fly
report of progress of the current run.
 
- 
3.C.8. The code can handle multiple datasets contained in the input
file, where generic input variables valid for all datasets
can be defined. These calculations for different dataset
can be chained, so that in one run, many complex tasks can be accomplished.
This allows easy convergence studies.
 
- 
3.C.9. The code can start the current run from a wavefunction input file
generated in a previous run. This allows to cut down the number
of iteration to self-consistency, or to perform other tasks.
This can be done even if the previous job
had different computational parameters, like different k points,
different energy cut-off, spin-unpolarized or spin-polarized wavefunctions,
scalr or spinor wavefunctions ... Of course, the restart from
wildly different parameters will not save a lot of CPU time.
 
- 
3.C.10. A GUI is available, see the directory gui, also containing instructions to run it.
 
 3.D. Documentation.
- 
3.D.1. A new_user_guide and a few rather detailed help files
(abinit_help, respfn_help, anaddb_help, mrgddb_help, newsp_help,
aim_help ... ) are available on-line.
 
- 
3.D.2. A tutorial with more than 30 lessons is available on-line.
It starts with the computation of
different properties of the H2 molecule, describes convergence studies,
then focuses on bulk Silicon, bulk Al and Al surface. It describes
the use of response-function features, the GW calculation of band structures,
the PAW  methodology, generation and validation of PAW atomic data, 
computation of spin-dependent properties,
DFT+U (incl. determination of U),
analysis tools, the phonon band structure, linear and non-linear optics,
electro-optic effects, electron-phonon coupling, phonon linewidths due
to electron-phonon interaction, Raman scattering efficiencies, piezoelectricity,
the computation of elastic constants, parallelism, the source code, etc ...
A set of seven lessons specifically deals with the use of the parallelism.
 
- 
3.D.3. Many test cases are provided, and can help the user in setting up a run.
 
- 
3.D.4. An ABINIT bibliography database, that contains references of papers
in which ABINIT or one of its predecessors is available on the Web site.
 
 3.E. Generation of the k-points, geometries, and starting wavefunctions.
- 
3.E.1. The code can automatically generate symmetries from the primitive cell
and the position of atoms. In this case, it identifies
automatically the Bravais lattice, point group and space group.
Alternatively, it can start from the
symmetries and generate the atomic positions from the irreducible
set. Also, a database of the 230 spatial groups of symmetry
is built inside ABINIT.
 
- 
3.E.2. Anti-ferromagnetic symmetry operations, and associated
magnetic space groups (Shubnikov groups) can be treated with the
same facility as usual space groups, including a database of the
1191 Shubnikov groups.
 
- 
3.E.3. A geometry builder is available inside the code.
It can take one (or two) group
of atoms, rotate it, translate it and repeat it, then create vacancies.
 
- 
3.E.4. The generation of special k point sets (Monkhorst-Pack sets)
and band structure k points can also be done directly inside ABINIT.
A list of interesting k point sets, can be generated automatically,
including a measure of their accuracy in term of integration
within the Brillouin Zone.
 
- 
3.E.5. A utility for generating wavefunctions with new characteristic
(cut-off, k-point) from already existing wavefunctions with
different characteristics is available (newsp).
 
 3.F. Automatic determination of input parameters.
- 
3.F.1. Many defaults values are provided.
 
- 
3.F.2. The FFT grid parameters can be automatically generated
from the cut-off energy and geometry of the system.
 
- 
3.F.3. The number of bands and starting occupation numbers can be automatically
generated from the input set of atoms.
 
- 
3.F.4. There is a database of atomic masses.
 
 
 
 4. The programmer's point of view.
- 
4.A. The code is distributed without charge under the
GNU General Public Licence (GPL). This
garantees that the future modifications of the code
stay available to the developpers and users for free.
This Copyright is often referred to as a "Copyleft".
 
- 
4.B. The code is written in clean Fortran90. Strict programming rules
have been followed. These are documented. Comments are numerous, and
all in english.
 
- 
4.C. Quick, fully automatic, testing of the code is available in the makefile,
giving diagnostics on the validity of computed energy, forces, stresses
and eigenvalues for five typical cases.
 
- 
4.D. More extensive testing is provided in five batteries of tests
(altogether more than 600 different runs), with automatic comparison
with results of preceding versions. A specialized diff script (called
'fldiff') has been written in order to ease the diagnostic on the
suite of tests. A short one-line-per-test file is produced (report file).
In addition to tests of the correctness of the execution,
for the sequential or parallel version of the main code, as well as some
utilities, there are automatic diagnostics of the speed of crucial
routines, and the response to a load of up to 4 instances of the
main code, running concurrently.
 
- 
4.E. Debugging facilities are provided inside the code, and can be directly
accessed from the input file. The compilation can be done in both
debugging or normal mode : the C-preprocessed files are either kept
or removed automatically.
 
- 
4.F. "Design-by-contract" utility routines are present, as well as
a flag to preprocess the sources, and eliminate the design-by-contract
features in production mode.