Exploiting decoupled discretization in meshless lattice Boltzmann method for high-Reynolds number flows
Dawid Strzelczyk∗, Maciej Matyka
Institute of Theoretical Physics, Faculty of Physics and Astronomy, University of Wrocław, pl. M. Borna 9, 50-204, Wrocław, Poland
∗ dawid.strzelczyk@uwr.edu.pl
Abstract One of the most severe limitations of the Lattice Boltzmann Method in the context of simulating inertial flows is the coupling of the discretization of space to the velocity discretization. It causes the need to increase the size of computational meshes whenever one wants to increase the Reynolds number in the system for a fixed velocity and viscosity. This work aims at adopting the recently proposed meshless LBM formulation to the problem of high Reynolds number flows by using its internal property of decoupled space and velocity discretizations. In meshless formulation, one can change the reference length of the problem by scaling the distances between points that discretize the domain while the streaming distance remains fixed. By doing that, we increase the Reynolds number at no cost: additional nodes need not be inserted in the discretization. We measure the accuracy and efficiency of this approach in the flow around a circular obstacle at –, the lid-driven cavity flow at –, and the flow through a porous sample at –. Additionally, we apply the meshless streaming step to the recently proposed fixed relaxation time LBM to extend its applicability to model inertial flows.
Keywords: Lattice Boltzmann Method, meshless methods, inertial flows, Reynolds number
1 Introduction
The transport of fluids through complex, porous structures is ubiquitous with many technological applications (CO2 storage, oil and shale gas extraction, electric battery design). It is of great interest for the study of human health, e.g. blood and cerebrospinal fluid flow, gas transport in the lungs[1, 2].
Experimental investigation of fluid flow phenomena in porous media is costly and requires sophisticated setups. Measuring velocity fields in creeping flows using Particle Image Velocimetry (PIV) methods has become popular [3, 4]. However, beyond Darcy Law (inertial regime) [5], PIV tracers used to track the flow face several problems. They may, for instance, damage the porous sample, and some of the flow regions may get excluded from the measurement due to the lack of tracers therein. Also, the tracers’ size must be chosen carefully to reconcile the accuracy of the scattered signal acquisition and the minimal interaction between the tracers, flow, and the pore space [6]. The remedy for this is to use numerical methods to solve fluid flow transport equations directly in pore space. Real samples, however, are relatively big compared to pore size, and using standard numerical methods based on computational grids may become impractical. This becomes most problematic at large Reynolds numbers, where fine flow structures appear.
Recently, the Lattice Boltzmann Method (LBM) became one of the most popular numerical tools used to solve the Navier-Stokes equations at the pore scale [7, 8]. It was applied to the simulation of fluid transport in various contexts of porous media [9], multiphase flows [10], semiclassical fluids flow [11], particulate suspensions [12] and especially inertial flows, and turbulent flows [13, 14]. Its popularity arises from the stable and local treatment of non-linear effects, a huge potential for parallelization of calculations, and simple implementation of basic no-slip boundary conditions on a regular grid [15, 16, 17]. The latter is especially attractive in complex porous media geometries, i.e. those arising from tomography scans in the form of binary images. Unfortunately, the memory overhead associated with storing the distribution function in LBM, and the need to regularly discretize the entire simulation space, limits the utility of this method for real-life, large samples with relatively small pore sizes.
One way to overcome the mentioned problem of excessive memory usage is to resort to LBM models with the fixed relaxation time (LBM1) [18, 19] where the distribution function is not explicitly stored in the memory. As shown in our previous work, LBM1 simplifies the main algorithm and reduces the memory requirement up to compared to the standard LBM solver [18]. Savings may also come from the use of the meshless discretization of space in LBM (meshless LBM, MLBM) [20]. This offers a significant advantage regarding the flexibility of space discretization, with LBM grid points distributed, i.e., in the pore-space only. We recently investigated its convergence in several scenarios, including low-Reynolds number flow through porous samples consisting of a network of two-dimensional channels and a three-dimensional periodic array of spheres [21].
By default, in both mentioned approaches, increasing the Reynolds number means either a decrease in viscosity, an increase in the flow velocity, or an increase in the size of the system. The first approach is limited by the minimum theoretical viscosity due to stability issues. Moreover, it becomes inaccessible in the formulation [18], where, by definition, we restrict the model to a fixed relaxation time, which involves fixing viscosity. Thus, considering the maximum velocity limit due to Mach number restrictions, increasing the Reynolds number by increasing the size of the system is the only way to obtain stable, high-Reynolds number fluid flow solutions in both MLBM and LBM1.
The aim of this work is to investigate the procedure of the domain reference length scaling in the meshless LBM (MLBM) to change the Reynolds number at zero computational and algorithmic cost [22] and to implement the meshless streaming step in the LBM1 model to achieve higher Reynolds numbers without additional memory overhead. We apply this scaling method to classic fluid flow problems: Kármán vortex street, lid-driven cavity and in the inertial regime of a flow through a porous medium.
2 Methods
2.1 Lattice Boltzmann Method
Lattice Boltzmann Method [15, 16] is a numerical tool for solving the discrete Boltzmann equation. During each timestep, two steps run - the streaming and the collision of the velocity distributions. The streaming is defined as:
(1) |
where is the -th distribution function with its streaming vector , denotes the vector antiparallel to () and the superscript ’post’ denotes the post-collision distribution function. According to this notation, denotes the departure node of the -th population being streamed to . Eq. (1) is written in a non-dimensionalized form with the timestep length and the lattice spacing equal to 1.
In this work, we use D2Q9 BGK model with 9 streaming directions ():
(2) |
Due to the Lagrangian nature of the streaming step, such a choice of the streaming vectors imposes the use of a square lattice (refer to Fig. 1 for a graphical representation of the LBM lattice and streaming directions). The collision step calculates from the current distributions and macroscopic fields. This work uses two relaxation time collision term [23]:
(3) |
where , are relaxation times for the symmetric and anti-symmetric parts of the distributions, which are defined as:
(4) |
The relaxation times are related to each other by:
(5) |
For the non-fixed relaxation time LBM and MLBM we use the value of [7]. The kinematic viscosity in the lattice units is then:
(6) |
where denotes the lattice speed of sound. The equilibrium distributions are expressed as:
(7) |
where is the weight specific to the -th streaming direction:
(8) |
The macroscopic density and velocity in lattice units, and respectively, at time and point , are obtained from the discrete populations:
(9) | ||||
The pressure is related to the density via the lattice speed of sound: .
In the numerical implementations, at each time step, Eq. (3) is first calculated to obtain the values of the post-collision distribution function (collision step). Then, (1) advects post-collision distributions to neighboring nodes (streaming step). Because lattice nodes coincide with the departure/arrival nodes of the streaming step, transport is purely Lagrangian and amounts to an index shift in the distribution function array.
The fixed-relaxation time variant of LBM assumes and [18, 19], which by Eq. (5) sets . In such case, Eq.(4) simplifies the streaming and collision procedures from Eqs. (1) and (3) to the same form as if Bhatnagar-Gross-Krook collision [24] was used with the unit relaxation time:
(10) |
This substitution alleviates the need to store the distribution function explicitly. In each timestep, the equilibrium distributions are calculated directly from the current macroscopic fields according to Eq. (7) and summed as in Eq. (9) to obtain the macroscopic fields in the next time step.
2.2 Formulation of meshless LBM
The meshless LBM algorithm considered in this work belongs to the family of off-grid LBM [25, 26, 27, 28, 20]. Those methods trade the Lagrangian approach to solving the streaming step for the semi-Lagrangian or Eulerian method, such that the space and velocity discretizations get decoupled, and nodes no longer need to be arranged in a regular grid. In the implementations taking the semi-Lagrangian approach to solve the streaming step, like the one discussed here, the discretization points (so-called Eulerian nodes) need not coincide with the departure points of the distribution function (so-called Lagrangian nodes). Instead, with each Eulerian point we identify Lagrangian points to which the corresponding distributions are interpolated and from which they are advected during the streaming step (see Fig. 1). To relate the Eulerian and Lagrangian nodes’ positions to one another, in MLBM the streaming vectors need to be expressed in physical units: where is the streaming distance. In this manner, the positions of the departure nodes of the -th population are related to the positions of their Eulerian nodes as . We introduce the interpolation step between the collision and the streaming to approximate the post-collision distributions in the appropriate Lagrangian points. Once the approximation is done, the streaming and the collision step are performed in the same way as in LBM (see. Fig. 2). We note that in the models with zero-velocity population discretized (i.e. in Eq. (2)), one does not perform the streaming for this population explicitly.
Among many interpolation methods available to solve the streaming step on irregular discretizations, meshless methods [29, 30, 31] are less computationally demanding and error-prone in the discretization process, they achieve boundary-compliant discretizations, offer high order approximations as well as feasible local refinement and adaptivity. Such approaches have been frequently used to solve transport partial differential equations [32, 33, 34, 35, 36, 37].
![Refer to caption](https://cdn.statically.io/img/arxiv.org/x1.png)
![Refer to caption](https://cdn.statically.io/img/arxiv.org/x2.png)
The semi-Lagrangian streaming step in the discussed MLBM uses meshless interpolation based on radial basis functions-generated finite differences (RBF-FD)[29]. This allows one to approximate the value of a linear operator applied to the underlying function in an arbitrary point as a linear combination of the known function values stored in the neighboring nodes:
(11) |
where denotes the approximation point, denote the approximation stencil members and are interpolation weights. In particular one can speak of the unit operator () which results in the interpolation formula in Eq. (11). We use local approximation supports for each Lagrangian node consisting of its closest Eulerian neighbors, unless otherwise stated. To achieve high-order approximation we use polynomial augmentation [38] of order two, unless otherwise stated. For a detailed description of the algorithm, we refer the reader to our previous work [20].
The application of the above formalism to LBM1 is the following. The macroscopic density and velocities are interpolated to each of the Lagrangian points. Then, the equilibrium distributions are calculated in the Lagrangian nodes, Eq.(7), and their values are summed according to Eq.(9) to obtain the next-timestep macroscopic fields in the Eulerian nodes. The interpolation of the macroscopic fields, rather than the equilibrium distributions, frees one from the need to store in the memory.
2.3 Increasing Reynolds number in meshless LBM
To assess the ratio between inertial and viscous forces in a flow, the dimensionless Reynolds number is used:
(12) |
arising naturally from non-dimensionalization of incompressible Navier-Stokes momentum equation:
(13) |
where asterisk denotes non-dimensionalized quantities, is the fluid’s kinematic viscosity, is the acceleration, and , , denotes the reference length, velocity, and time, respectively. Conventionally, when inertial forces dominate the system, the Reynolds number is larger than unity . The practical utility of the Reynolds number lies in its ability to relate flows of different length and time scales to each other, e.g., flow past a real object versus its scaled model used in an experiment. The dependence of hydrodynamic coefficients, like drag, lift, or permeability, is often presented in relation to , thus making the results more universal. Reynolds number also allows to estimate the relative importance of non-linearity, turbulence, and momentum diffusion phenomena in a flow when choosing an appropriate physical model to describe the system.
In MLBM, the streaming distance length can be varied independently from the positions of the nodes discretizing the domain. It means that the whole space spanned by the streaming distance and the Eulerian discretization parameter is available in MLBM, in contrast to traveling along the line in the standard LBM (see Fig. 3). For instance, consider a flow domain of size . Let us assume that the streaming distance is (we do this by specifying the positions of Lagrangian points to differ from their Eulerian lattice centers by vectors). With such choice of and we arrive at the domain size in lattice units equal to , no matter what Eulerian discretization parameter we choose. Now, increasing the size of the domain , e.g., by multiplying the position vectors of the Eulerian points by some constant (with unaltered) changes the domain size in lattice units times. The same effect can be achieved by multiplying the streaming distance by and keeping the positions of Eulerian points constant (we use the notation to distinguish the factor for the scaling of the streaming distance from the one for the scaling of the Eulerian points positions). This is because the approximation coefficients from Eq. (11) are invariant under translation, rotation, and scaling of the stencil nodes and the interpolation point, and the presented MLBM algorithm uses the non-dimensionalized formulation of LBM ( and in lattice units).
Both strategies (depending on scaling the domain size in physical units or the streaming distance length) alter the flow’s Reynolds number in the following ways. Considering the Reynolds number calculated from the quantities in lattice units, both strategies change the size of the domain, changing the value from the previous paragraph. The same happens with calculated using the quantities in physical units when the scaling is used. Finally, when the scaling is applied to the streaming distance (i.e. ), to keep the physical viscosity unchanged, timestep length must scale with the square of , since . When this happens, the physical velocity changes into its counterpart in the scaled setup as
(14) |
which in the case of decreasing the streaming distance () means increasing the velocity in physical units.
![Refer to caption](https://cdn.statically.io/img/arxiv.org/x3.png)
In general, the scaling of the streaming distance is followed by the scaling of the conversion factors from lattice units to physical units:
(15) |
We note that the discussed method of increasing the Reynolds number in MLBM should apply when Eulerian schemes are used for solving the streaming step[39]. The scaled streaming distance would then be equivalent to the scaled streaming velocity in the term of the Boltzmann equation, where is the microscopic population velocity.
3 Results
3.1 Validation and execution time analysis
We investigate the flow over a cylindrical obstacle in a plane channel, Fig. 4. The length of the channel is , the height of the channel is and the cylindrical obstacle of diameter is placed on the channel’s axis, at a distance from its inlet.
For MLBM simulations we use the scattered discretization near the cylinder and in its wake and the regular discretization elsewhere. We use the meshless RBF-FD approximation for the Lagrangian points whose closest Eulerian neighbors do not form a regular lattice. In the remaining points we use biquadratic Lagrange interpolation. The distance between the nodes vary from on the cylinder’s surface to away from the cylinder. The set of points used for the discretization is symmetric about the channel’s axis to achieve a symmetric lift force profile in time. The boundary conditions on the obstacle’s surface is the no-slip wall. It is implemented using a simple bounceback rule for the populations for which the Lagrangian points lie inside the cylinder. The inlet and the outlet has the prescribed parabolic velocity profile with the maximum value of and unit density . Zero macroscopic velocity and unit density is imposed on the top and bottom boundaries. The former four boundaries are implemented by overwriting all populations in their nodes with the equilibrium distributions parametrized with the desired macroscopic density and velocity. The initial condition are the equilibrium populations parametrized with the unit density and zero macroscopic velocity .
![Refer to caption](https://cdn.statically.io/img/arxiv.org/extracted/5712466/kvs_points.png)
![Refer to caption](https://cdn.statically.io/img/arxiv.org/extracted/5712466/kvs_points_zoom.png)
The Reynolds number of each case was calculated based on the inlet velocity and the cylinder’s diameter :
(16) |
The values considered in MLBM simulations are . We increase by decreasing the streaming distance length from for to for . The relaxation time is and the diffusive scaling of the timestep is used.
For the reference in timings analysis we perform a series of standard LBM simulations with the same initial and boundary conditions as in the MLBM setup. The Reynolds numbers considered in LBM simulations are . The lattice parameter for is and the increase in is achieved by the appropriate decrease in , such that yields the value . The relaxation time is which is close to the stability limit. Also here we use the diffusive scaling of the timestep. We note that MLBM setups use higher value of due to stability issues.
We investigate the onset of inertial effects in the flow qualitatively by observing the appearance of the Kármán vortex street in the obstacle’s wake and quantitatively via the increase of the Strouhal number :
(17) |
where is the frequency of the wake vortices shedding.
![Refer to caption](https://cdn.statically.io/img/arxiv.org/extracted/5712466/CYLINDER_Re084.png)
![Refer to caption](https://cdn.statically.io/img/arxiv.org/extracted/5712466/CYLINDER_Re168.png)
Fig. 5 shows velocity streamlines of the channel flow for two extreme Reynolds numbers: obtained with LBM and obtained with MLBM. The onset of inertial effects is visible as the distance between the repeating velocity field patterns in the obstacle’s wake decreases.
![Refer to caption](https://cdn.statically.io/img/arxiv.org/x4.png)
Fig. 6 shows the values of the Strouhal number against the Reynolds number of the flows. The present results are in good agreement with a fit to the experimental data for the cylinder’s diameter-to-length ratio equal to 2000 [40] and earlier experimental works [41, 42, 43].
Let us now move to the timings analysis. For this comparison, LBM and MLBM codes are executed single-threaded on 2x Intel Xeon E5520 with 12GB RAM machine. Fig. 7 shows the CPU time needed to simulate 1s of vortex shedding. For both methods, the execution time increases with . This is caused by the diffusive scaling of the timestep with the streaming distance and the decreasing value of with the increasing . This means that more timesteps is needed to reach 1s in the simulation for larger . Nevertheless, for MLBM the execution time rises slower than for LBM ( vs. ), since in the former it is only the timestep number that increases with . For LBM, on the other hand, not only the timestep number increases with , but also the total number of nodes in the domain does. For the highest value of considered here, MLBM performs times faster than LBM.
![Refer to caption](https://cdn.statically.io/img/arxiv.org/x5.png)
To analyze the sources of MLBM speedup, we consider the CPU time needed to perform iterations in a single-threaded run on the same machine as before, shown in Fig. 8. The data are presented for LBM and for three scenarios of MLBM. The first scenario uses fully scattered discretization with RBF-FD approximation at each Lagrangian node, with uniform in space internodal distance equal to . In the second scenario we exchange the scattered discretization away from the cylinder and its wake for a regular points arrangement. We also use biquadratic Lagrange interpolation for the Lagrangian nodes whose closest Eulerian neighbors form a regular lattice. The third scenario additionally features coarser discretization away from the cylinder and is the one used in the calculation of the Strouhal number. The number of nodes in the LBM and MLBM setups is summarized in Table 1. The presented execution time increases with for LBM because of the increasing number of nodes in the domain when the streaming distance is decreased. This is in contrast to MLBM, when the increase of is achieved only by decreasing the value of , with no additional nodes inserted in the domain. Due to this, the execution time stays constant for each scenario across the considered Reynolds numbers. Further, the execution times differ between the MLBM scenarios. About speedup between scenario 1 and 2 is achieved by switching from RBF-FD approximation with nodes to the computationally cheaper biquadratic Lagrange interpolation away from the cylinder. It is achieved in spite of the increase of the total number of nodes by about . Decreasing the discretization density away from the cylinder accounts for the subsequent speedup between scenario 2 and 3.
![Refer to caption](https://cdn.statically.io/img/arxiv.org/x6.png)
LBM case | |
---|---|
MLBM case | |
---|---|
scattered uniform | |
hybrid uniform | |
hybrid coarsened |
3.2 Application of MLBM1 in the lid-driven cavity
The flow domain is a square (Fig. 9). We use irregular discretizations with the Eulerian discretization parameter uniform in space. We set the top lid velocity to where is the reference velocity in LB units and is the truncated -coordinate. The no-slip boundary condition on the left, right, and bottom walls is implemented via multireflection bounceback [44]. The unknown populations at the top boundary are equilibrium distributions parametrized with and velocities: and . We introduce to be able to calculate streamed from or . To increase the Reynolds number, we scale the positions of Eulerian points by the factor of . Along with the streaming distance and physical viscosity , this gives Reynolds numbers based on the square side length and top lid velocity equal to and . We initialize the simulation with equilibrium distributions parametrized with zero velocity and unit density and iterate the simulation until the relative change of the -component of the velocity between two timesteps defined as:
(18) |
falls below or up to (for ) or (for ) iterations. By we mean the value of the -component of the velocity at the -th Eulerian node at the -th timestep.
![Refer to caption](https://cdn.statically.io/img/arxiv.org/extracted/5712466/CAVITY_points_actual.png)
![Refer to caption](https://cdn.statically.io/img/arxiv.org/x7.png)
![Refer to caption](https://cdn.statically.io/img/arxiv.org/x8.png)
![Refer to caption](https://cdn.statically.io/img/arxiv.org/x9.png)
![Refer to caption](https://cdn.statically.io/img/arxiv.org/x10.png)
Figure 10 shows the normalized and component of velocity profiles along the centerlines of the cavity (, after rescaling the domain by ). Small circles denote MLBM1 results and are compared with the results of Ghia [45] denoted by larger squares. A good compliance between the two methods is visible, especially in the middle of the primary vortex. The difference in the velocity profiles increases closer to the boundaries, possibly due to large velocity gradients and shear stresses. Fig. 11 shows the centers of the primary vortex and the secondary vortices in the bottom left and bottom right corners of the cavity. The compliance between MLBM1 results and Ghia’s solution [45] is within about 1% of the domain side length. As the case exhibited an oscillatory behavior of slowly decreasing amplitude, we plot the path of the primary vortex and the secondary vortex in the bottom left corner of the cavity in Fig. 12. After iterations, corresponding to , the centers of the vortices oscillate around points close to those reported by Ghia [45]. We note that to obtain a stable solution in case, we needed to increase the order of augmentation in the meshless approximation to the fourth and the number of stencil members to . Despite the stability, spurious shockwaves propagating from the right boundary could be seen throughout the simulation.
3.3 Inertial effects in a flow through a porous sample
![Refer to caption](https://cdn.statically.io/img/arxiv.org/extracted/5712466/POROUS_points.png)
![Refer to caption](https://cdn.statically.io/img/arxiv.org/extracted/5712466/POROUS_points_detail.png)
We investigate the onset of inertial effects in a flow through a porous medium (see Fig. 13) of porosity with MLBM1. We use irregular discretization with the Eulerian discretization parameter . The no-slip boundary condition is imposed on the top and bottom boundary and the circular obstacles’ surfaces using the multireflection bounce back rule [44]. The periodic boundary condition is imposed on the left and right boundary by periodic search of the stencil members. The flow is forced by a constant acceleration of value along the direction. The meshless discretization of the domain corresponds to its size of . The investigated values of the scaling parameter range from to . We note that for , we could not obtain a stable solution despite increasing the stencil size to and using fourth-order polynomial augmentation. The streaming distance length is chosen to be , timestep length is and the kinematic viscosity is . The Reynolds number for each case was calculated based on the mean flow velocity in the -direction :
(19) |
sample’s side length equal to and kinematic viscosity .
We show the visualization of the velocity field for three chosen values of in Fig. 14. As the Reynolds number increases, recirculation zones grow causing the streamlines to separate from the grains’ wakes.
![Refer to caption](https://cdn.statically.io/img/arxiv.org/x11.png)
![Refer to caption](https://cdn.statically.io/img/arxiv.org/x12.png)
![Refer to caption](https://cdn.statically.io/img/arxiv.org/x13.png)
The inertial effects appearance can be quantified in terms of kinetic energy distribution in the system [5]. The left plot in Fig. 15 shows the fraction of the kinetic energy defined as confined in vortices to the total kinetic energy in the system:
(20) |
We use the Q-criterion to define a vortex as a volume where:
(21) |
and . By we mean the velocity components (i.e. and ). In the range of – a rapid increase in the value of the fraction from Eq. (20) occurs.
We can relate this to the transition in the behavior of the . In the Darcy regime (inertial effects negligible), is proportional to the pressure loss between the inlet and the outlet of the sample:
(22) |
where is the sample’s permeability, is the sample length, and the last equality holds if we relate the acceleration forcing the flow to the mean pressure drop via . Next, Eq. (22) needs to be related to the changes in the size of the sample since this is the way of increasing in our study. If in Eq. (13) we omit the pressure gradient term and change the reference length from to , we find that the only difference between the two systems (the one rescaled by and the non-rescaled one) is in the term . This means that a system scaled by a factor of behaves as if it was not scaled, but the viscosity was changed by a factor of (due to the term in the denominator of ). Inserting this relation into Eq. (22) gives:
(23) |
A similar procedure applied in the Forchheimer regime (leaving only the second order term ) yields:
(24) |
Presenting Eqs. (23) and (24) in logarithmic scale gives:
(25) |
which is visible in the right plot of Fig. 15. The transition between the second and the first order relations, denoted by dashed grey lines in Fig. 15, occurs at the scaling factor for which (see the inset in the left subplot of Fig. 15) and corresponds well with the predictions of the onset of inertial effects from the vortices’ kinetic energy approach.
![Refer to caption](https://cdn.statically.io/img/arxiv.org/x14.png)
![Refer to caption](https://cdn.statically.io/img/arxiv.org/x15.png)
Computational complexity and memory demands of the MLBM1 algorithm
Matyka and Dzikowski [18] derived the expression for the memory demands of their algorithm, which yields:
(26) |
where is the number of nodes in the discretization, is the number of macroscopic fields (in our case 3 - , and ), and is the number of bytes per single number (e.g. 8 for double). MLBM1 uses additional memory to store the interpolation weight vectors for each of the Lagrangian nodes (the term appears because no interpolation is needed for the populations with zero microscopic velocity). Assuming that each interpolation stencil has the same number of nodes , this results in the memory demands for MLBM1:
(27) |
where is the number of Eulerian nodes. Taking the ratio of the two gives:
(28) |
In our case, , , and for most of the presented cases, . Considering that sparser discretizations can be used in the meshless algorithm, it turns out that MLBM1 can outperform LBM1 regarding memory demands for the same streaming distance value. For example, in the driven cavity flow, we used , which would give the number of nodes in the LBM1 setup ranging from for to for . At the same time the term in brackets in Eq. (28) is equal to for and for (due to the change of for ). This means that for the MLBM1 domain with Eulerian nodes, the ratio ranges from times more memory in the case of MLBM1 for to times more memory in the case of LBM1 for . This shows the possibility of considerably reducing the number of nodes in LBM1 models with an off-grid (especially meshless) streaming procedure. When a proper local refinement strategy is used and less computationally intensive approximation scheme is employed e.g. in the bulk of the flow, as shown in Section 3.1, the memory savings may come at no cost to the solution’s accuracy.
The computational complexity of MLBM1 carries the overhead of interpolation during the streaming step. To exploit the absence of explicit storage of the distribution function in the memory, the collision (i.e., the calculation of ) must take place in Lagrangian nodes, and the macroscopic fields must be interpolated to those points. Otherwise (if the collision took place in Eulerian nodes), would have to be stored in memory between its calculation and interpolation. This means that for each Eulerian point, one needs operations per timestep, where is the number of operations needed to calculate and is the number of operations needed to interpolate a scalar within a stencil (note that for one does not need to interpolate, but still the calculation of is necessary). This shows that the savings in memory for distributions come at the cost of times more interpolations which have to be done for each Lagrangian node, compared to MLBM (in MLBM, one interpolates only one population to each Lagrangian point).
4 Discussion
The decoupled space and velocity discretizations in MLBM change the paradigm of increasing the model’s size typical for LBM and make it more similar to the one of the Navier-Stokes solvers. In LBM, the information about the characteristic size of the problem is encoded in the number of lattice links/nodes that discretize this size. So is the accuracy of the approximation in space. In MLBM, the information about the system’s size is stored in the ratio between the streaming distance and the distance between Eulerian points representing the characteristic length. The approximation accuracy in space depends on the distances between the approximation stencil members. It is invariant under their translation, rotation, and scaling by a constant as long as the approximated function is devoid of discontinuities. Consequently, in MLBM, the velocity discretization parameters are determined by the desired compressibility errors. In contrast, the space discretization parameters are determined by the flow domain’s geometry and the desired approximation accuracy in space. This is interpreted as hypervolumes in the space of those two groups of discretization parameters accessible to MLBM, where each point might be assigned its Reynolds number, in contrast to hypersurfaces in the case of LBM.
We chose the presented way of increasing the Reynolds number in MLBM to make the procedure as similar to the LBM approach as possible and highlight how the two methods relate to each other. This allows us to state the following. First, the qualitative results show that the described approach causes the Reynolds number to increase, in line with the findings of previous authors [22]. It also suggests that the propagation of information in the model based on the approximation, rather than the exact streaming, does not impair the physical mechanism of mass and momentum transfer in the discrete Boltzmann equation. Large streaming distances compared to the spacings between the Eulerian points correspond to fast momentum diffusion, small ratios - on the contrary. Second, quantitative analysis underlines the importance of the approximation accuracy for the solution. At the same time, the presented discretization parameters may serve as a crude estimation of the method’s stability limits along with the aids to mitigate the divergence.
The accuracy and convergence of MLBM and other off-grid LBM methods were investigated in numerous works [27, 46, 20], especially in application to inertial flows in works by Lin and others [47] or Musavi and Ashrafizaadeh [48, 28]. According to the findings by, e.g., Krämer and others [46], the decreasing ratio of introduces larger approximation errors in OLBM. When semi-Lagrangian streaming is used, this has directly to do with the error term characteristic for semi-Lagrangian methods, as suggested in our previous work [20]. To prevent the divergence of this term, for the growing , one would need to either increase , which would again decrease the size of the domain, or increase the accuracy of the approximation. We presented this procedure in two tests. First, we used regular space discretization in the far-field subdomain of the flow past a cylinder, in contrast to an irregular discretization around the cylinder. Second, in the driven cavity test for the highest Reynolds number (), the augmentation order and the number of stencil members were increased compared to the cases with lower to mitigate the divergence of the simulation. However, the mentioned flow through a porous medium for shows that increasing the approximation order or the stencil size does not always guarantee stability retrieval. In such cases, the Eulerian discretization refinement should be used. On the other hand, from the LBM error point of view, decreasing the ratio acts in favor of the compressibility error, which would increase when higher would be achieved by larger inlet velocity or acceleration. Finally, the error of the solution obtained with MLBM on rescaled domains may also come from the under-resolution of physical phenomena, such as small vortices typical for inertial flows. However, this under-resolution might be exploited to the method’s advantage, as suggested by Chen [49].
The solution’s accuracy considerations must always be counterbalanced by preserving the simulation’s efficiency and robustness. In MLBM, one can look for memory and computational savings in discretizing space and velocity independently. First, the feasibility of the local static and dynamic refinement in meshless approximation methods offers great promise to reduce the number of Eulerian points in MLBM compared to LBM. Apart from an arbitrary refinement, as shown in the present work in the case of the flow around a cylinder, the choice of a proper error indicator, such as quantities based on vorticity suggested by Fakhari and Lee [50] or on the velocity field divergence [15], can allow for dynamic - or -refinement during the simulation run. In turn, the error of the solution becomes more uniform in space granting a better accuracy at a little increase of the computational expense. Second, switching to regular discretizations with the approximation schemes computationally less intensive than the scattered ones, can considerably decrease the computational burden in the parts of the domain devoid of geometrical intricacies. The research on this approach is given attention in the field of meshless analysis [51, 52, 53]. Finally, LBM1 models [18, 19] reduce the memory overhead by explicitly storing just a few macroscopic fields rather than a large number of velocity distributions. We show that by introducing meshless space discretization, it is possible to achieve a several-fold memory advantage in comparison to the lattice-based LBM1, which is already estimated to use about less memory than its arbitrary relaxation time counterpart [18].
5 Conclusions
We investigate the procedure to increase the Reynolds number of a flow from laminar to the transitional regime in meshless LBM simulations by increasing the system’s domain size. We achieve this solely by multiplying the coordinates of Eulerian points by some constant , or equivalently - the streaming distance by its inverse , thus no additional nodes are inserted into the domain. We apply this method of the Reynolds number increase to a general meshless LBM scheme and to the meshless implementation of a memory-efficient LBM model with the relaxation time (LBM1). We show that the procedure works by numerically investigating three test cases in a range of Reynolds numbers - von Kârmn̂ vortex street behind a cylindrical obstacle of infinite length, driven cavity flow, and a flow through a porous sample. The qualitative observation of the velocity fields indicates the onset of inertial effects in the flows. Also, the obtained values of hydrodynamic parameters agree well with the numerical and experimental references in a wide range of Reynolds numbers. We highlight the computational speedup of MLBM compared to the standard LBM when a proper choice of the nodes arrangement and approximation scheme is made. From the performed simulations, we estimate the possible memory savings of the meshless LBM1 compared to the standard (lattice-based) LBM1 to be as high as 32-fold and pinpoint the source of the computational overhead of the meshless LBM1.
Our findings open new perspectives for further studies on the development and applications of MLBM, and its competitiveness with LBM. Questions yet to be answered address the issues of the possibility of simultaneous scaling of space and velocity discretizations, or the detailed mechanism of the information propagation when the non-purely Lagrangian streaming step is used.
6 Acknowledgments
Funded by National Science Centre, Poland under the OPUS call in the Weave programme 2021/43/I/ST3/00228. This research was funded in whole or in part by National Science Centre (2021/43/I/ST3/00228). For the purpose of Open Access, the author has applied a CC-BY public copyright licence to any Author Accepted Manuscript (AAM) version arising from this submission.
References
- [1] D. N. Ku, Blood flow in arteries, Annual review of fluid mechanics 29 (1) (1997) 399–434.
- [2] D. H. Kelley, J. H. Thomas, Cerebrospinal fluid flow, Annual Review of Fluid Mechanics 55 (2023) 237–264.
- [3] M. R. Morad, A. Khalili, Transition layer thickness in a fluid-porous medium of multi-sized spherical beads, Experiments in Fluids 46 (2) (2009) 323–330.
- [4] M. Souzy, H. Lhuissier, Y. Méheust, T. Le Borgne, B. Metzger, Velocity distributions, dispersion and stretching in three-dimensional porous media, Journal of Fluid Mechanics 891 (2020) A16.
-
[5]
J. S. Andrade, U. M. S. Costa, M. P. Almeida, H. A. Makse, H. E. Stanley,
Inertial effects
on fluid flow through disordered porous media, Phys. Rev. Lett. 82 (1999)
5249–5252.
doi:10.1103/PhysRevLett.82.5249.
URL https://link.aps.org/doi/10.1103/PhysRevLett.82.5249 -
[6]
R. Sánchez-González, S. W. North,
Chapter
18 - nitric oxide laser-induced fluorescence imaging methods and their
application to study high-speed flows, in: J. Laane (Ed.), Frontiers and
Advances in Molecular Spectroscopy, Elsevier, 2018, pp. 599–630.
doi:https://doi.org/10.1016/B978-0-12-811220-5.00019-8.
URL https://www.sciencedirect.com/science/article/pii/B9780128112205000198 - [7] C. Pan, L.-S. Luo, C. T. Miller, An evaluation of lattice boltzmann schemes for porous medium flow simulation, Computers & fluids 35 (8-9) (2006) 898–909.
- [8] C. Gao, R.-N. Xu, P.-X. Jiang, Pore-scale numerical investigations of fluid flow in porous media using lattice boltzmann method, International Journal of Numerical Methods for Heat & Fluid Flow 25 (8) (2015) 1957–1977.
- [9] M. Matyka, Z. Koza, How to Calculate Tortuosity Easily?, AIP Conference Proceedings 1453 (1) (2011) 17–22. doi:10.1063/1.4711147.
-
[10]
G. Falcucci, E. Jannelli, S. Ubertini, S. Succi,
Direct
numerical evidence of stress-induced cavitation, Journal of Fluid Mechanics
728 (2013) 362–375.
doi:10.1017/jfm.2013.271.
URL https://www.cambridge.org/core/product/identifier/S0022112013002711/type/journal_article - [11] R. C. Coelho, M. M. Doria, Lattice Boltzmann Method for Semiclassical Fluids, Computers and Fluids 165 (October 2017) (2018) 144–159. doi:10.1016/j.compfluid.2018.01.019.
- [12] A. J. Ladd, Numerical Simulations of Particulate Suspensions Via a Discretized Boltzmann Equation. Part 1. Theoretical Foundation, Journal of Fluid Mechanics 271 (1994) 285–309. doi:10.1017/S0022112094001771.
-
[13]
X. Sun, C. Zhang, X. Wang,
A
DNS investigation by LBM: Acoustic characteristics of a flow
around rod-hydrofoil configuration at different angles of attack, Ocean
Engineering 266 (2022) 112779.
doi:10.1016/j.oceaneng.2022.112779.
URL https://linkinghub.elsevier.com/retrieve/pii/S0029801822020625 -
[14]
L. A. Hegele, A. Scagliarini, M. Sbragaglia, K. K. Mattila, P. C. Philippi,
D. F. Puleri, J. Gounley, A. Randles,
High-Reynolds-number
turbulent cavity flow using the lattice Boltzmann method, Physical
Review E 98 (4) (2018) 043302.
doi:10.1103/PhysRevE.98.043302.
URL https://link.aps.org/doi/10.1103/PhysRevE.98.043302 - [15] S. Succi, The Lattice Boltzmann Equation: For Complex States of Flowing Matter, Oxford University Press, 2018. doi:10.1093/oso/9780199592357.001.0001.
- [16] T. Krüger, H. Kusumaatmaja, A. Kuzmin, O. Shardt, G. Silva, E. M. Viggen, The Lattice Boltzmann Method: Principles and Practice, Graduate Texts in Physics, Springer International Publishing, Cham, 2017. doi:10.1007/978-3-319-44649-3.
- [17] Z. Guo, C. Shu, Lattice Boltzmann Method: And Its Applications in Engineering, no. vol. 3 in Advances in Computational Fluid Dynamics, World Scientific, Singapore, 2013.
-
[18]
M. Matyka, M. Dzikowski,
Memory-efficient
lattice boltzmann method for low reynolds number flows, Computer Physics
Communications 267 (2021) 108044.
doi:https://doi.org/10.1016/j.cpc.2021.108044.
URL https://www.sciencedirect.com/science/article/pii/S0010465521001569 -
[19]
J. G. Zhou, Macroscopic Lattice
Boltzmann Method, Water 13 (1) (2020) 61.
doi:10.3390/w13010061.
URL https://www.mdpi.com/2073-4441/13/1/61 -
[20]
D. Strzelczyk, M. Matyka,
Study
of the convergence of the meshless lattice boltzmann method in
taylor–green, annular channel and a porous medium flows, Computers and
Fluids 269 (2024) 106122.
doi:https://doi.org/10.1016/j.compfluid.2023.106122.
URL https://www.sciencedirect.com/science/article/pii/S004579302300347X - [21] D. Strzelczyk, M. Rot, M. Matyka, G. Kosec, On meshless solution to navier-stokes problem in porous media: comparing meshless lattice boltzman method with acm rbf-fd approach (in preparation) (2024).
- [22] X. He, L.-S. Luo, M. Dembo, Some progress in the lattice boltzmann method: Reynolds number enhancement in simulations, Physica A: Statistical Mechanics and its Applications 239 (1-3) (1997) 276–285.
- [23] I. Ginzburg, F. Verhaeghe, D. d’Humières, Two-Relaxation-Time Lattice Boltzmann Scheme: About Parametrization, Velocity, Pressure and Mixed Boundary Conditions, Communications in Computational Physics.
- [24] P. L. Bhatnagar, E. P. Gross, M. Krook, A Model for Collision Processes in Gases. I. Small Amplitude Processes in Charged and Neutral One-Component Systems, Phys. Rev. 94 (3) (1954) 511–525. doi:10.1103/PhysRev.94.511.
- [25] X. He, L. S. Luo, M. Dembo, Some Progress in Lattice Boltzmann Method. Part I. Nonuniform Mesh Grids, Journal of Computational Physics 129 (2) (1996) 357–363. doi:10.1006/jcph.1996.0255.
-
[26]
S. Ubertini, G. Bella, S. Succi,
Lattice
Boltzmann method on unstructured grids: Further developments,
Physical Review E 68 (1) (2003) 016701.
doi:10.1103/PhysRevE.68.016701.
URL https://link.aps.org/doi/10.1103/PhysRevE.68.016701 - [27] A. Krämer, K. Küllmer, D. Reith, W. Joppich, H. Foysi, Semi-Lagrangian off-Lattice Boltzmann Method for Weakly Compressible Flows, Physical Review E 95 (2) (2017) 1–12. doi:10.1103/PhysRevE.95.023305.
-
[28]
S. H. Musavi, M. Ashrafizaadeh,
A
mesh-free lattice Boltzmann solver for flows in complex geometries,
International Journal of Heat and Fluid Flow 59 (2016) 10–19.
doi:10.1016/j.ijheatfluidflow.2016.01.006.
URL https://linkinghub.elsevier.com/retrieve/pii/S0142727X16300030 - [29] M. Buhmann, Bengt Fornberg, Natasha Flyer: “A Primer on Radial Basis Functions with Applications to the Geosciences”, Jahresbericht der Deutschen Mathematiker-Vereinigung 119 (1) (2017) 53–58. doi:10.1365/s13291-016-0149-y.
- [30] W. Chen, Z. J. Fu, C. S. Chen, Recent Advances in Radial Basis Function Collocation Methods, no. 9783642395710, 2014. doi:10.1007/978-3-642-39572-7.
- [31] G. S. Bhatia, G. Arora, Radial Basis Function Methods for Solving Partial Differential Equations-A Review, Indian Journal of Science and Technology 9 (45). doi:10.17485/ijst/2016/v9i45/105079.
- [32] N. Flyer, G. B. Wright, Transport Schemes on a Sphere Using Radial Basis Functions, Journal of Computational Physics 226 (1) (2007) 1059–1084. doi:10.1016/j.jcp.2007.05.009.
- [33] V. Shankar, G. B. Wright, R. M. Kirby, A. L. Fogelson, Augmenting the Immersed Boundary Method with Radial Basis Functions (RBFs) for the Modeling of Platelets in Hemodynamic Flows, International Journal for Numerical Methods in Fluids 79 (10) (2015) 536–557. doi:10.1002/fld.4061.
- [34] V. Shankar, G. B. Wright, R. M. Kirby, A. L. Fogelson, A Radial Basis Function (RBF)-Finite Difference (FD) Method for Diffusion and Reaction–Diffusion Equations on Surfaces, Journal of Scientific Computing 63 (3) (2015) 745–768. doi:10.1007/s10915-014-9914-1.
- [35] G. Kosec, B. Šarler, Solution of Thermo-Fluid Problems by Collocation with Local Pressure Correction, International Journal of Numerical Methods for Heat and Fluid Flow 18 (7-8) (2008) 868–882. doi:10.1108/09615530810898999.
- [36] G. Kosec, A Local Numerical Solution of a Fluid-Flow Problem on an Irregular Domain, Advances in Engineering Software 120 (2016) 36–44. doi:10.1016/j.advengsoft.2016.05.010.
- [37] J. Slak, G. Kosec, Adaptive Radial Basis Function– Generated Finite Differences Method for Contact Problems, International Journal for Numerical Methods in Engineering 119 (7) (2019) 661–686. doi:10.1002/nme.6067.
- [38] N. Flyer, B. Fornberg, V. Bayona, G. A. Barnett, On the Role of Polynomials in RBF-FD Approximations: I. Interpolation and Accuracy, Journal of Computational Physics 321 (2016) 21–38. doi:10.1016/j.jcp.2016.05.026.
- [39] T. Lee, C. L. Lin, An Eulerian Description of the Streaming Process in the Lattice Boltzmann Equation, Journal of Computational Physics 185 (2) (2003) 445–471. doi:10.1016/S0021-9991(02)00065-7.
-
[40]
C. Norberg,
An
experimental investigation of the flow around a circular cylinder: Influence
of aspect ratio, Journal of Fluid Mechanics 258 (1994) 287–316.
doi:10.1017/S0022112094003332.
URL https://www.cambridge.org/core/product/identifier/S0022112094003332/type/journal_article - [41] J. Gerrard, The wakes of cylindrical bluff bodies at low Reynolds number, Philosophical Transactions of the Royal Society of London. Series A, Mathematical and Physical Sciences 288 (1354) (1978) 351–382.
- [42] E. Berger, The determination of the hydrodynamic parameters of a karman vortex street from hot wire measurements at low reynolds number, Z. Flugwiss 12 (41) (1964) 41–59.
- [43] E. Berger, Transition of the laminar vortex flow to the turbulent state of the Karman vortex street behind an oscillating cylinder at low Reynolds number, Jahrb. Wiss. Gess. LR 164.
-
[44]
I. Ginzburg, D. d’Humières,
Multireflection
boundary conditions for lattice boltzmann models, Phys. Rev. E 68 (2003)
066614.
doi:10.1103/PhysRevE.68.066614.
URL https://link.aps.org/doi/10.1103/PhysRevE.68.066614 - [45] U. Ghia, K. N. Ghia, C. T. Shin, High-Re Solutions for Incompressible Flow Using the Navier-Stokes Equations and a Multigrid Method, Journal of Computational Physics 48 (3) (1982) 387–411. doi:10.1016/0021-9991(82)90058-4.
-
[46]
A. Krämer, D. Wilde, K. Küllmer, D. Reith, H. Foysi, W. Joppich,
Lattice
Boltzmann simulations on irregular grids: Introduction of the
NATriuM library, Computers & Mathematics with Applications 79 (1)
(2020) 34–54.
doi:10.1016/j.camwa.2018.10.041.
URL https://linkinghub.elsevier.com/retrieve/pii/S0898122118306382 - [47] X. Lin, J. Wu, T. Zhang, A Mesh-Free Radial Basis Function– Based Semi-Lagrangian Lattice Boltzmann Method for Incompressible Flows, International Journal for Numerical Methods in Fluids 91 (4) (2019) 198–211. doi:10.1002/fld.4749.
-
[48]
S. H. Musavi, M. Ashrafizaadeh,
Development
of a three dimensional meshless numerical method for the solution of the
Boltzmann equation on complex geometries, Computers & Fluids 181 (2019)
236–247.
doi:10.1016/j.compfluid.2019.01.021.
URL https://linkinghub.elsevier.com/retrieve/pii/S0045793019300209 -
[49]
H. Chen, Volumetric
formulation of the lattice Boltzmann method for fluid dynamics: Basic
concept, Physical Review E 58 (3) (1998) 3955–3963.
doi:10.1103/PhysRevE.58.3955.
URL https://link.aps.org/doi/10.1103/PhysRevE.58.3955 -
[50]
A. Fakhari, T. Lee,
Finite-difference
lattice Boltzmann method with a block-structured adaptive-mesh-refinement
technique, Physical Review E 89 (3) (2014) 033310.
doi:10.1103/PhysRevE.89.033310.
URL https://link.aps.org/doi/10.1103/PhysRevE.89.033310 -
[51]
M. Rot, M. Jančič, G. Kosec,
Spatially
dependent node regularity in meshless approximation of partial differential
equations, Journal of Computational Science 79 (2024) 102306.
doi:10.1016/j.jocs.2024.102306.
URL https://linkinghub.elsevier.com/retrieve/pii/S1877750324000991 - [52] A. Javed, K. Djidjeli, J. T. Xing, S. J. Cox, A Hybrid Mesh Free Local RBF- Cartesian FD Scheme for Incompressible Flow around Solid Bodies 7 (10).
-
[53]
H. Ding, C. Shu, K. Yeo, D. Xu,
Simulation
of incompressible viscous flows past a circular cylinder by hybrid FD
scheme and meshless least square-based finite difference method, Computer
Methods in Applied Mechanics and Engineering 193 (9-11) (2004) 727–744.
doi:10.1016/j.cma.2003.11.002.
URL https://linkinghub.elsevier.com/retrieve/pii/S0045782503005838