-
Notifications
You must be signed in to change notification settings - Fork 109
Feature/mrhs solvers #1489
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Feature/mrhs solvers #1489
Changes from 112 commits
Commits
Show all changes
114 commits
Select commit
Hold shift + click to select a range
4951f11
Fix warning in spin taste and minor cleanup
maddyscientist 7c4793b
Some cleanup of CG interface
maddyscientist 9a2190c
Add MRHS interface for all solvers, and mandate source vector is const
maddyscientist fda4669
Optimize DiracWilson: vectorize the prepare/reconstruct functions
maddyscientist 44fb98a
Small cleanup to block_transpose.in.cu
maddyscientist fdd40fb
Add new parameter: QudaMultigridParam::n_vec_batch which is the batch…
maddyscientist fa64adf
Vectorize DiracCoarsePC prepare/reconstruct
maddyscientist fef58e8
Ensure we don't enable large arg support for pre Volta architecture
maddyscientist 8b8cd99
Create vector variants of create_alias
maddyscientist ee6fd26
Add some more scalar wrappers: this facilitates us making the vector …
maddyscientist ac23c73
Supress annoying warning with Eigen
maddyscientist 2b0763b
Add default copy/move constructors/assignment operator for XUpdateBat…
maddyscientist 70a94df
Add some useful overloads to vector class to facilitate writing batch…
maddyscientist f792a33
Add explicit casting to double in anticipation of making the cast ope…
maddyscientist 57ba15e
First pass at enabling MRHS for CG, MR and SD solvers. To better fin…
maddyscientist 224bdb2
Accelerate MG::verify by using batch blas where applicable
maddyscientist 6256391
Fix bug in MRE solver
maddyscientist 7962dc3
Apply MRHS optimization to MRE solver
maddyscientist 075cfb8
Remove complex.h inclusion
maddyscientist 7cbab27
Vectorize all remaining Dirac prepare/reconstruct functions
maddyscientist d488607
Fix bug in GammaApply with introduced in #1416
maddyscientist 6d1bafe
Fix issue with CG::hq_solve
maddyscientist d526544
Merge branch 'develop' of github.com:lattice/quda into feature/mrhs-s…
maddyscientist 902d8ab
Fix bug with Clover Hasenbsusch operator (wrong braces)
maddyscientist 02eecaa
Fix bug with DiracCoarsePC::reconstruct when using odd solve
maddyscientist 8b067d4
Fix bug with counting bytes with clover operator
maddyscientist c682cae
Default inner GCR solver to use L2 residual to enable early exit if p…
maddyscientist 1c5baef
Initial work to prepare for multi-rhs solver exposure: move the body …
maddyscientist faf4658
Fix flops counters for blas and reduce functions
maddyscientist b2f9849
Move remainder of invertQuda body into new MRHS solve wrapper that is…
maddyscientist c51f6e6
Fix true residual computation: QudaInvertParam::true_res and QudaInve…
maddyscientist c877352
invert_test and staggered_invert_test now respect --nsrc-tile flag fo…
maddyscientist 19d3348
Add some size checks to P and R
maddyscientist 3583ef8
Use batched blas in DiracCoarse
maddyscientist f64a9ac
Set verbosity in solve()
maddyscientist fa26c4e
GCR, CA-GCR and PreconditionedSolver are now MRHS aware
maddyscientist f3a3d8e
Multigrid solver is now MRHS aware
maddyscientist eefe8c7
Remove some legacy debug code from multigrid
maddyscientist f58bc3b
Add rescaling to coarse dslash with MMA - the code still needs cleanup.
hummingtree 958bf12
Add tensor core support for 32/64 MG coarsening
maddyscientist 21b4c08
Add striped signifier to packing kernel tune key
maddyscientist fce329d
Fix multi-RHS deflation
maddyscientist d9efb9c
Augmentation of state reporting to report the power, energy, temperat…
maddyscientist a781103
We should probably use MPI_THREAD_FUNNELED given we have threads now....
maddyscientist 8d3b59e
Report energy when running the solver now
maddyscientist 56ddf51
Fix Ampere+ mma kernels
maddyscientist 20d33dd
Fix staggered MG bug
maddyscientist 5f8f398
Clean up the coarse dslash MMA code:
hummingtree 1b220b7
cvector -> vector
maddyscientist ab3f667
Merge branch 'feature/mrhs-solvers' of github.com:lattice/quda into f…
hummingtree 7af1ade
Fix MPI bug
maddyscientist a20bf0e
Merge branch 'feature/mrhs-solvers' of github.com:lattice/quda into f…
maddyscientist 414260a
Fix deflateSVD for block deflation
maddyscientist 181a52e
Add some sanity checking when using split grid
maddyscientist 4447320
If communicator is not found, do not call errorQuda (which causes an …
maddyscientist 994bdd8
Fix some verbosity aspects of tuning
maddyscientist 49c6c8a
Merge branch 'develop' of github.com:lattice/quda into feature/mrhs-s…
maddyscientist ceb84fa
Merge branch 'develop' of github.com:lattice/quda into feature/mrhs-s…
maddyscientist 35e17db
Check set sizes match when copying between them
maddyscientist 661a2a1
Multi-RHS solvers should check to see if their state needs to be resized
maddyscientist 2dd2502
Add iterator-pair constructor for quda::vector class
maddyscientist 3ad7a57
MRHS optimizations for eigensolver: exposed new parameter QudaInvertP…
maddyscientist a386654
Preserve eigen space when running multi-src deflated solves
maddyscientist 2eb1289
Merge branch 'develop' of github.com:lattice/quda into feature/mrhs-s…
maddyscientist 0ed1fe9
Fix CI warnings (one of which was a real bug)
maddyscientist 75a3b4c
More CI warnings
maddyscientist 37a6ae9
Fix some outstanding CI issues
maddyscientist fc0762d
renaming as suggested in CI
maddyscientist 44d8a2a
Use std::vector iterator constructor
maddyscientist e89be7d
Revert change made in this branch
maddyscientist dcd0d43
Cleanup of DiracCloverHasenbuschTwistPC
maddyscientist f0d9f3c
Cleanup ero source checking in the solvers
maddyscientist 36c9b6d
CGNR and CGNE are now MRHS
maddyscientist d5c6708
CG3 is now MRHS
maddyscientist 2e52dad
Remove derived CGNR and CGNE specialziations for CG/CA-CG/CG3: we now…
maddyscientist 0803993
Optimize HQ in CA-GCR
maddyscientist e1589e5
CA-CG is now MRHS
maddyscientist a3186a0
BiCGStab is now MRHS
maddyscientist cbf5943
BiCGStab(l) is now multi-RHS
maddyscientist dce23c6
Fix typo. Closes #1492
maddyscientist 827700d
Use fine-grain parallelization for CopySpinor
maddyscientist ade0e16
Ensure that mg_eig_evals_batch_size in test code has sensible default
maddyscientist 36ba139
Updated MILC interface to batched CG, hq tolerance bugfix in CG itself
weinbe2 50e7879
PCG is now multi-RHS ready. Improve robustness of Solver convergence…
maddyscientist c7f05b5
Fix default asan options which broke when the separate test library w…
maddyscientist 0073449
Merge branch 'develop' of github.com:lattice/quda into feature/mrhs-s…
maddyscientist 2f3faab
Conditionally print energy information
maddyscientist 5af9d01
Reduce memory for clover force (use smaller halo for extended field
maddyscientist 764fb78
Fix compiler warning
maddyscientist 7523f3a
Fix bug in CA CG
maddyscientist e46b436
Work arounds for NVSHMEM due to coarse grained synchronization used i…
maddyscientist aa598b6
Merge branch 'develop' of github.com:lattice/quda into feature/mrhs-s…
maddyscientist 093e0e6
Split grid true residuals now correctly returned in QudaInvertParam s…
maddyscientist a3e889d
invert_test should never call invertMultiSrcQuda is multishift is int…
maddyscientist 9933e4d
Some MRHS HQ related solver fixes
maddyscientist da38153
More MRHS solver fixes
maddyscientist e5d74e4
Add QudaInvertParam::energy/power/temp/clock to check_param
maddyscientist 3bfcc68
Wilson ctest invert_test now uses multi-RHS
maddyscientist d3217ad
Fix memory freeing with chrono predictor
maddyscientist 1e28221
Fix CG3 for MRHS
maddyscientist 9936354
Fix clover force test
maddyscientist a0184d6
Heterogeneous reductions now break up the device-local partial read a…
maddyscientist e51c59c
ctest should use mrhs for asqtad solver test
maddyscientist 70a3b75
Fix typo
maddyscientist 9d4abe9
Fix for QudaMultigridParam::dslash_use_mma so that it respects the co…
maddyscientist a9ef50b
Apply clang format
maddyscientist 5c3192a
Updated the MILC HISQ MG interface for setting batch sizes
weinbe2 7903288
Set QudaMultigridParam::n_vec_batch to invalid to force user to set t…
maddyscientist e862afd
Merge branch 'feature/mrhs-solvers' of github.com:lattice/quda into f…
maddyscientist 7a5fb37
Made nvec_batch more robust in the MILC HISQ MG interface
weinbe2 4af762d
Merge branch 'feature/mrhs-solvers' of https://github.com/lattice/qud…
weinbe2 2d56bfd
bump CPM (silences some warnings with newer cmake)
mathiaswagner 05b2bc6
Fix typo
maddyscientist 4cef59f
Merge branch 'feature/mrhs-solvers' of github.com:lattice/quda into f…
maddyscientist File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.