Probabilistic algorithms

In Press
Convergence Analysis of the Stochastic Resolution of Identity: Comparing Hutchinson to Hutch++ for the Second-Order Green's Function
Mejía, L. ; Sharma, S. ; Baer, R. ; Chan, G. K. - L. ; Rabani, E. Convergence Analysis of the Stochastic Resolution of Identity: Comparing Hutchinson to Hutch++ for the Second-Order Green's Function. Journal of Chemical Physics In Press. Publisher's VersionAbstract

Stochastic orbital techniques offer reduced computational scaling and memory requirements to describe ground and excited states at the cost of introducing controlled statistical errors. Such techniques often rely on two basic operations, stochastic trace estimation and stochastic resolution of identity, both of which lead to statistical errors that scale with the number of stochastic realizations (\$N\_\\textbackslashxi\\$) as \$\textbackslashsqrt\N\_\\textbackslashxi\ˆ\-1\\\$. Reducing the statistical errors without significantly increasing \$N\_\\textbackslashxi\\$ has been challenging and is central to the development of efficient and accurate stochastic algorithms. In this work, we build upon recent progress made to improve stochastic trace estimation based on the ubiquitous Hutchinson's algorithm and propose a two-step approach for the stochastic resolution of identity, in the spirit of the Hutch++ method. Our approach is based on employing a randomized low-rank approximation followed by a residual calculation, resulting in statistical errors that scale much better than \$\textbackslashsqrt\N\_\\textbackslashxi\ˆ\-1\\\$. We implement the approach within the second-order Born approximation for the self-energy in the computation of neutral excitations and discuss three different low-rank approximations for the two-body Coulomb integrals. Tests on a series of hydrogen dimer chains with varying lengths demonstrate that the Hutch++-like approximations are computationally more efficient than both deterministic and purely stochastic (Hutchinson) approaches for low error thresholds and intermediate system sizes. Notably, for arbitrarily large systems, the Hutchinson-like approximation outperforms both deterministic and Hutch++-like methods.

2024
Stochastic density functional theory combined with Langevin dynamics for warm dense matter
Hadad, E. R. ; Roy, A. ; Rabani, E. ; Redmer, R. ; Baer, R. Stochastic density functional theory combined with Langevin dynamics for warm dense matter. Physical Review E 2024, 109, 065304. Publisher's VersionAbstract

This study overviews and extends a recently developed stochastic finite-temperature Kohn-Sham density functional theory to study warm dense matter using Langevin dynamics, specifically under periodic boundary conditions. The method's algorithmic complexity exhibits nearly linear scaling with system size and is inversely proportional to the temperature. Additionally, a novel linear-scaling stochastic approach is introduced to assess the Kubo-Greenwood conductivity, demonstrating exceptional stability for DC conductivity. Utilizing the developed tools, we investigate the equation of state, radial distribution, and electronic conductivity of Hydrogen at a temperature of 30,000K. As for the radial distribution functions, we reveal a transition of Hydrogen from gas-like to liquid-like behavior as its density exceeds 4 g/cm³. As for the electronic conductivity as a function of the density, we identified a remarkable isosbestic point at frequencies around 7eV, which may be an additional signature of a gas-liquid transition in Hydrogen at 30,000K.

Weak second-order quantum state diffusion unraveling of the Lindblad master equation
Baer, R. ; Adhikari, S. Weak second-order quantum state diffusion unraveling of the Lindblad master equation. Journal of Chemical Physics 2024, 160, 064107. Publisher's VersionAbstract

Abstract Simulating mixed-state evolution in open quantum systems is crucial for various chemical physics, quantum optics, and computer science applications. These simulations typically follow the Lindblad master equation dynamics. An alternative approach known as quantum state diffusion unraveling is based on the trajectories of pure states generated by random wave functions, which evolve according to a nonlinear Itô-Schrödinger equation (ISE). This study introduces weak first- and second-order solvers for the ISE based on directly applying the Itô-Taylor expansion with exact derivatives in the interaction picture. We tested the method on free and driven Morse oscillators coupled to a thermal environment and found that both orders allowed practical estimation with a few dozen iterations. The variance was relatively small compared to the linear unraveling and did not grow with time. The second-order solver delivers much higher accuracy and stability with bigger time steps than the first-order scheme, with a small additional workload. However, the second-order algorithm has quadratic complexity with the number of Lindblad operators as opposed to the linear complexity of the first-order algorithm.

AdhikariWeak2024.pdf
2022
Linear Weak Scalability of Density Functional Theory Calculations without Imposing Electron Localization
Fabian, M. D. ; Shpiro, B. ; Baer, R. Linear Weak Scalability of Density Functional Theory Calculations without Imposing Electron Localization. J. Chem. Theory Comput. 2022, acs.jctc.1c00829. Publisher's VersionAbstract

Linear scaling density functional theory (DFT) approaches to the electronic structure of materials are often based on the tendency of electrons to localize in large atomic and molecular systems. However, in many cases of actual interest, such as semiconductor nanocrystals, system sizes can reach a substantial extension before significant electron localization sets in, causing a considerable deviation from linear scaling. Herein, we address this class of systems by developing a massively parallel DFT approach which does not rely on electron localization and is formally quadratic scaling yet enables highly efficient linear wall-time complexity in the weak scalability regime. The method extends from the stochastic DFT approach described in Fabian et al. (WIRES: Comp. Mol. Sci. 2019, e1412) but is entirely deterministic. It uses standard quantum chemical atomcentered Gaussian basis sets to represent the electronic wave functions combined with Cartesian real-space grids for some operators and enables a fast solver for the Poisson equation. Our main conclusion is that when a processor-abundant high-performance computing (HPC) infrastructure is available, this type of approach has the potential to allow the study of large systems in regimes where quantum confinement or electron delocalization prevents linear scaling.

Stochastic Vector Techniques in Ground-State Electronic Structure
Baer, R. ; Neuhauser, D. ; Rabani, E. Stochastic Vector Techniques in Ground-State Electronic Structure. Annu. Rev. Phys. Chem. 2022, 73, annurev–physchem–090519–045916. Publisher's VersionAbstract

We review a suite of stochastic vector computational approaches for studying the electronic structure of extended condensed matter systems. These techniques help reduce algorithmic complexity, facilitate efficient parallelization, simplify computational tasks, accelerate calculations, and diminish memory requirements. While their scope is vast, we limit our study to ground-state and finite temperature density functional theory (DFT) and second-order perturbation theory. More advanced topics, such as quasiparticle (charge) and optical (neutral) excitations and higher-order processes, are covered elsewhere. We start by explaining how to use stochastic vectors in computations, characterizing the associated statistical errors. Next, we show how to estimate the electron density in DFT and discuss highly effective techniques to reduce statistical errors. Finally, we review the use of stochastic vector techniques for calculating correlation energies within the secondorder Møller-Plesset perturbation theory and its finite temperature variational form. Example calculation results are presented and used to demonstrate the efficacy of the methods.

Forces from Stochastic Density Functional Theory under Nonorthogonal Atom-Centered Basis Sets
Shpiro, B. ; Fabian, M. D. ; Rabani, E. ; Baer, R. Forces from Stochastic Density Functional Theory under Nonorthogonal Atom-Centered Basis Sets. J. Chem. Theory Comput. 2022, 18, 1458–1466. Publisher's VersionAbstract

We develop a formalism for calculating forces on the nuclei within the linear-scaling stochastic density functional theory (sDFT) in a nonorthogonal atomcentered basis set representation (Fabian et al. Wiley Interdiscip. Rev.: Comput. Mol. Sci. 2019, 9, e1412, 10.1002/wcms.1412) and apply it to the Tryptophan Zipper 2 (Trpzip2) peptide solvated in water. We use an embedded-fragment approach to reduce the statistical errors (fluctuation and systematic bias), where the entire peptide is the main fragment and the remaining 425 water molecules are grouped into small fragments. We analyze the magnitude of the statistical errors in the forces and find that the systematic bias is of the order of 0.065 eV/Å (∼1.2 × 10−3Eh/a0) when 120 stochastic orbitals are used, independently of system size. This magnitude of bias is sufficiently small to ensure that the bond lengths estimated by stochastic DFT (within a Langevin molecular dynamics simulation) will deviate by less than 1% from those predicted by a deterministic calculation.

2021
Tempering stochastic density functional theory
Nguyen, M. ; Li, W. ; Li, B. (Y. ); Baer, R. ; Rabani, E. ; Neuhauser, D. Tempering stochastic density functional theory. J. Chem. Phys. 2021, 5.0063266. Publisher's VersionAbstract

We introduce a tempering approach with stochastic density functional theory (sDFT), labeled t-sDFT, which reduces the statistical errors in the estimates of observable expectation values. This is achieved by rewriting the electronic density as a sum of a "warm" component complemented by "colder" correction(s). Since the "warm" component is larger in magnitude but faster to evaluate, we use many more stochastic orbitals for its evaluation than for the smaller-sized colder correction(s). This results in a significant reduction of the statistical fluctuations and the bias compared to sDFT for the same computational effort. We the method's performance on large hydrogen-passivated silicon nanocrystals (NCs), finding a reduction in the systematic error in the energy by more than an order of magnitude, while the systematic errors in the forces are also quenched. Similarly, the statistical fluctuations are reduced by factors of around 4-5 for the total energy and around 1.5-2 for the forces on the atoms. Since the embedding in t-sDFT is fully stochastic, it is possible to combine t-sDFT with other variants of sDFT such as energy-window sDFT and embedded-fragmented sDFT.

nguyen2021temperin.pdf
Stochastic density functional theory: Real- and energy-space fragmentation for noise reduction
Chen, M. ; Baer, R. ; Neuhauser, D. ; Rabani, E. Stochastic density functional theory: Real- and energy-space fragmentation for noise reduction. J. Chem. Phys. 2021, 154, 204108. Publisher's VersionAbstract

Stochastic density functional theory (sDFT) is becoming a valuable tool for studying ground-state properties of extended materials. The computational complexity of describing the Kohn–Sham orbitals is replaced by introducing a set of random (stochastic) orbitals leading to linear and often sub-linear scaling of certain ground-state observables at the account of introducing a statistical error. Schemes to reduce the noise are essential, for example, for determining the structure using the forces obtained from sDFT. Recently, we have introduced two embedding schemes to mitigate the statistical fluctuations in the electron density and resultant forces on the nuclei. Both techniques were based on fragmenting the system either in real space or slicing the occupied space into energy windows, allowing for a significant reduction in the statistical fluctuations. For chemical accuracy, further reduction of the noise is required, which could√be achieved by increasing the number of stochastic orbitals. However, the convergence is relatively slow as the statistical error scales as 1/ Nχ according to the central limit theorem, where Nχ is the number of random orbitals. In this paper, we combined the embedding schemes mentioned above and introduced a new approach that builds on overlapped fragments and energy windows. The new approach significantly lowers the noise for ground-state properties, such as the electron density, total energy, and forces on the nuclei, as demonstrated for a G-center in bulk silicon.

chen2021stochastic.pdf
2020
Dopant levels in large nanocrystals using stochastic optimally tuned range-separated hybrid density functional theory
Lee, A. J. ; Chen, M. ; Li, W. ; Neuhauser, D. ; Baer, R. ; Rabani, E. Dopant levels in large nanocrystals using stochastic optimally tuned range-separated hybrid density functional theory. Physical Review B 2020, 102, 035112. Publisher's Version lee2020dopant.pdf
Linear-Response Time-Dependent Density Functional Theory with Stochastic Range-Separated Hybrids
Zhang, X. ; Lu, G. ; Baer, R. ; Rabani, E. ; Neuhauser, D. Linear-Response Time-Dependent Density Functional Theory with Stochastic Range-Separated Hybrids. Journal of Chemical Theory and Computation 2020, 16, 1064–1072. Publisher's VersionAbstract

Generalized Kohn−Sham density functional theory is a popular computational tool for the ground state of extended systems, particularly within range-separated hybrid (RSH) functionals that capture the long-range electronic interaction. Unfortunately, the heavy computational cost of the nonlocal exchange operator in RSH-DFT usually confines the approach to systems with at most a few hundred electrons. A significant reduction in the computational cost is achieved by representing the density matrix with stochastic orbitals and a stochastic decomposition of the Coulomb convolution (J. Phys. Chem. A 2016, 120, 3071). Here, we extend the stochastic RSH approach to excited states within the framework of linear-response generalized Kohn−Sham time-dependent density functional theory (GKS-TDDFT) based on the plane-wave basis. As a validation of the stochastic GKS-TDDFT method, the excitation energies of small molecules N2 and CO are calculated and compared to the deterministic results. The computational efficiency of the stochastic method is demonstrated with a two-dimensional MoS2 sheet (∼1500 electrons), whose excitation energy, exciton charge density, and (excited state) geometric relaxation are determined in the absence and presence of a point defect.

zhang2020linear.pdf
Range-separated stochastic resolution of identity: Formulation and application to second-order Green’s function theory
Dou, W. ; Chen, M. ; Takeshita, T. Y. ; Baer, R. ; Neuhauser, D. ; Rabani, E. Range-separated stochastic resolution of identity: Formulation and application to second-order Green’s function theory. The Journal of Chemical Physics 2020, 153, 074113. Publisher's VersionAbstract

We develop a range-separated stochastic resolution of identity (RS-SRI) approach for the four-index electron repulsion integrals, where the larger terms (above a predefined threshold) are treated using a deterministic RI and the remaining terms are treated using a SRI. The approach is implemented within a second-order Green’s function formalism with an improved O(N3) scaling with the size of the basis set, N. Moreover, the RS approach greatly reduces the statistical error compared to the full stochastic version [T. Y. Takeshita et al., J. Chem. Phys. 151, 044114 (2019)], resulting in computational speedups of ground and excited state energies of nearly two orders of magnitude, as demonstrated for hydrogen dimer chains and water clusters.

dou2020range.pdf
Efficient Langevin dynamics for "noisy" forces
Arnon, E. ; Rabani, E. ; Neuhauser, D. ; Baer, R. Efficient Langevin dynamics for "noisy" forces. J. Chem. Phys. 2020, 152, 161103. Publisher's VersionAbstract

Efficient Boltzmann-sampling using first-principles methods is challenging for extended systems due to the steep scaling of electronic structure methods with the system size. Stochastic approaches provide a gentler system-size dependency at the cost of introducing "noisy" forces, which serve to limit the efficiency of the sampling. In the first-order Langevin dynamics (FOLD), efficient sampling is achievable by combining a well-chosen preconditioning matrix S with a time-step-bias-mitigating propagator (Mazzola et al., Phys. Rev. Lett., 118, 015703 (2017)). However, when forces are noisy, S is set equal to the force-covariance matrix, a procedure which severely limits the efficiency and the stability of the sampling. Here, we develop a new, general, optimal, and stable sampling approach for FOLD under noisy forces. We apply it for silicon nanocrystals treated with stochastic density functional theory and show efficiency improvements by an order-of-magnitude.

arnon2020efficient.pdf
2019
Energy window stochastic density functional theory
Chen, M. ; Baer, R. ; Neuhauser, D. ; Rabani, E. Energy window stochastic density functional theory. The Journal of Chemical Physics 2019, 151, 114116. Publisher's Version chen2019energy.pdf
Stochastic Resolution of Identity for Real-Time Second-Order Green’s Function: Ionization Potential and Quasi-Particle Spectrum
Dou, W. ; Takeshita, T. Y. ; Chen, M. ; Baer, R. ; Neuhauser, D. ; Rabani, E. Stochastic Resolution of Identity for Real-Time Second-Order Green’s Function: Ionization Potential and Quasi-Particle Spectrum. Journal of Chemical Theory and Computation 2019. Publisher's VersionAbstract

We develop a stochastic resolution of identity approach to the real-time second-order Green’s function (real-time sRI-GF2) theory, extending our recent work for imaginary-time Matsubara Green’s function [Takeshita et al. J. Chem. Phys. 2019, 151, 044114]. The approach provides a framework to obtain the quasi-particle spectra across a wide range of frequencies and predicts ionization potentials and electron affinities. To assess the accuracy of the real-time sRI-GF2, we study a series of molecules and compare our results to experiments as well as to a many-body perturbation approach based on the GW approximation, where we find that the real-time sRI-GF2 is as accurate as self-consistent GW. The stochastic formulation reduces the formal computatinal scaling from O(Ne5) down to O(Ne3) where Ne is the number of electrons. This is illustrated for a chain of hydrogen dimers, where we observe a slightly lower than cubic scaling for systems containing up to Ne ≈ 1000 electrons.

dou2019stochastic.pdf
Transition to metallization in warm dense helium-hydrogen mixtures using stochastic density functional theory within the Kubo-Greenwood formalism
Cytter, Y. ; Rabani, E. ; Neuhauser, D. ; Preising, M. ; Redmer, R. ; Baer, R. Transition to metallization in warm dense helium-hydrogen mixtures using stochastic density functional theory within the Kubo-Greenwood formalism. Physical Review B 2019, 100. Publisher's VersionAbstract

Abstract The Kubo-Greenwood (KG) formula is often used in conjunction with Kohn-Sham (KS) density functional theory (DFT) to compute the optical conductivity, particularly for warm dense mater. For applying the KG formula, all KS eigenstates and eigenvalues up to an energy cutoff are required and thus the approach becomes expensive, especially for high temperatures and large systems, scaling cubically with both system size and temperature. Here, we develop an approach to calculate the KS conductivity within the stochastic DFT (sDFT) framework, which requires knowledge only of the KS Hamiltonian but not its eigenstates and values. We show that the computational effort associated with the method scales linearly with system size and reduces in proportion to the temperature unlike the cubic increase with traditional deterministic approaches. In addition, we find that the method allows an accurate description of the entire spectrum, including the high-frequency range, unlike the deterministic method which is compelled to introduce a high-frequency cut-off due to memory and computational time constraints. We apply the method to helium-hydrogen mixtures in the warm dense matter regime at temperatures of \textbackslashsim60\textbackslashtext\kK\ and find that the system displays two conductivity phases, where a transition from non-metal to metal occurs when hydrogen atoms constitute \textbackslashsim0.3 of the total atoms in the system.

cytter2019transition.pdf
Quantum Monte Carlo assessment of density functionals for π-electron molecules: ethylene and bifuran
Ospadov, E. ; Rothstein, S. M. ; Baer, R. Quantum Monte Carlo assessment of density functionals for π-electron molecules: ethylene and bifuran. Molecular Physics 2019, 117, 2241–2250. Publisher's VersionAbstract

We perform all-electron, pure-sampling quantum Monte Carlo (QMC) calculations on ethylene and bifuran molecules. The orbitals used for importance sampling with a single Slater determinant are generated from Hartree-Fock and density functional theory (DFT). Their fixed-node energy provides an upper bound to the exact energy. The best performing density functionals for ethylene are BP86 and M06, which account for 99% of the electron correlation energy. Sampling from the π-electron distribution with these orbitals yields a quadrupole moment comparable to coupled cluster CCSD(T) calculations. However, these, and all other density functionals, fail to agree with CCSD(T) while sampling from electron density in the plane of the molecule. For bifuran, as well as ethylene, a correlation is seen between the fixed-node energy and deviance of the QMC quadrupole moment estimates from those calculated by DFT. This suggests that proximity of DFT and QMC densities correlates with the quality of the exchange nodes of the DFT wave function for both systems.

ospadov2019quantum.pdf
Stochastic embedding DFT: Theory and application to p-nitroaniline in water
Li, W. ; Chen, M. ; Rabani, E. ; Baer, R. ; Neuhauser, D. Stochastic embedding DFT: Theory and application to p-nitroaniline in water. 2019, 151, 174115. Publisher's VersionAbstract

Over this past decade, we combined the idea of stochastic resolution of identity with a variety of electronic structure methods. In our stochastic Kohn-Sham density functional theory (DFT) method, the density is an average over multiple stochastic samples, with stochastic errors that decrease as the inverse square root of the number of sampling orbitals. Here, we develop a stochastic embedding density functional theory method (se-DFT) that selectively reduces the stochastic error (specifically on the forces) for a selected subsystem(s). The motivation, similar to that of other quantum embedding methods, is that for many systems of practical interest, the properties are often determined by only a small subsystem. In stochastic embedding DFT, two sets of orbitals are used: a deterministic one associated with the embedded subspace and the rest, which is described by a stochastic set. The method agrees exactly with deterministic calculations in the limit of a large number of stochastic samples. We apply se-DFT to study a p-nitroaniline molecule in water, where the statistical errors in the forces on the system (the p-nitroaniline molecule) are reduced by an order of magnitude compared with nonembedding stochastic DFT.

li2019stochastic.pdf
Nonmonotonic band gap evolution in bent phosphorene nanosheets
Vlček, V. ; Rabani, E. ; Baer, R. ; Neuhauser, D. Nonmonotonic band gap evolution in bent phosphorene nanosheets. Phys. Rev. Materials 2019, 3 064601. Publisher's VersionAbstract

Nonmonotonic bending-induced changes of fundamental band gaps and quasiparticle energies are observed for realistic nanoscale phosphorene nanosheets. Calculations using stochastic many-body perturbation theory show that even slight curvature causes significant changes in the electronic properties. For small bending radii (\textless4 nm) the band gap changes from direct to indirect. The response of phosphorene to deformation is strongly anisotropic (different for zigzag vs armchair bending) due to an interplay of exchange and correlation effects. Overall, our results show that fundamental band gaps of phosphorene sheets can be manipulated by as much as 0.7 eV depending on the bending direction.

vlcek_et_al._-_2019_-_nonmonotonic_band_gap_evolution_in_bent_phosphoren.pdf
Stochastic density functional theory
Fabian, M. D. ; Shpiro, B. ; Rabani, E. ; Neuhauser, D. ; Baer, R. Stochastic density functional theory. Wiley Interdisciplinary Reviews: Computational Molecular Science 2019, 10.1002/wcms.1412, e1412. Publisher's VersionAbstract

Linear-scaling implementations of density functional theory (DFT) reach their intended efficiency regime only when applied to systems having a physical size larger than the range of their Kohn–Sham density matrix (DM). This causes a problem since many types of large systems of interest have a rather broad DM range and are therefore not amenable to analysis using DFT methods. For this reason, the recently proposed stochastic DFT (sDFT), avoiding exhaustive DM evaluations, is emerging as an attractive alternative linear-scaling approach. This review develops a general formulation of sDFT in terms of a (non)orthogonal basis representation and offers an analysis of the statistical errors (SEs) involved in the calculation. Using a new Gaussian-type basis-set implementation of sDFT, applied to water clusters and silicon nanocrystals, it demonstrates and explains how the standard deviation and the bias depend on the sampling rate and the system size in various types of calculations. We also develop a basis-set embedded-fragments theory, demonstrating its utility for reducing the SEs for energy, density of states and nuclear force calculations. Finally, we discuss the algorithmic complexity of sDFT, showing it has CPU wall-time linear-scaling. The method parallelizes well over distributed processors with good scalability and therefore may find use in the upcoming exascale computing architectures. This article is categorized under: Electronic Structure Theory \textgreater Ab Initio Electronic Structure Methods Structure and Mechanism \textgreater Computational Materials Science Electronic Structure Theory \textgreater Density Functional Theory

fabian2019stochastic.pdf
Overlapped embedded fragment stochastic density functional theory for covalently-bonded materials
Chen, M. ; Baer, R. ; Neuhauser, D. ; Rabani, E. Overlapped embedded fragment stochastic density functional theory for covalently-bonded materials. J. Chem. Phys. 2019, 150, 034106. Publisher's VersionAbstract

The stochastic density functional theory (DFT) [R. Baer et al., Phys. Rev. Lett. 111, 106402 (2013)] is a valuable linear-scaling approach to Kohn-Sham DFT that does not rely on the sparsity of the density matrix. Linear (and often sub-linear) scaling is achieved by introducing a controlled statistical error in the density, energy, and forces. The statistical error (noise) is proportional to the inverse square root of the number of stochastic orbitals and thus decreases slowly; however, by dividing the system into fragments that are embedded stochastically, the statistical error can be reduced significantly. This has been shown to provide remarkable results for non-covalently-bonded systems; however, the application to covalently bonded systems had limited success, particularly for delocalized electrons. Here, we show that the statistical error in the density correlates with both the density and the density matrix of the system and propose a new fragmentation scheme that elegantly interpolates between overlapped fragments. We assess the performance of the approach for bulk silicon of varying supercell sizes (up to Ne = 16 384 electrons) and show that overlapped fragments reduce significantly the statistical noise even for systems with a delocalized density matrix.

chen2018overlapped.pdf

Pages