Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
1922 commits
Select commit Hold shift + click to select a range
f144774
[Dynamo] Fix llvm target for meta schedule & add torch to tvm ndarray…
Dec 7, 2022
91b4a82
[inductor] Use a large tolerance for botnet26t_256 (#90383)
desertfire Dec 7, 2022
470cb1c
[FSDP][optim_state_dict][2/N] Add _get_fqn_to_fsdp_param_info to map …
fegin Dec 7, 2022
131256e
Revert "[pruning][core][feature] Implement prune for structured pruni…
pytorchmergebot Dec 7, 2022
e42286d
Fix some typed storage is deprecated warnings. (#89867)
ezyang Dec 7, 2022
6b3d09a
Fix AssertionError fake_mode is not None in distributed (#90392)
ezyang Dec 7, 2022
51ccda6
[fx] Copy codegen in legalize_graph (#90023)
angelayi Dec 7, 2022
8ec0772
[Vulkan] Enable copying QInt8 and QInt32 tensors from cpu to vulkan. …
manuelcandales Dec 7, 2022
9114478
[FSDP][Easy] ufmt files (#90384)
Dec 7, 2022
412e6e5
Automated submodule update: FBGEMM (#74729)
facebook-github-bot Dec 7, 2022
ca1c020
Add manual cuda deps search logic (#90411)
malfet Dec 7, 2022
f0ded45
[Vulkan][TCC] Add tests for quantized convolution with QUInt8 activat…
manuelcandales Dec 7, 2022
ab4d2a6
Fix C2 Ambiguous namespace (#89534)
xw285cornell Dec 7, 2022
01c4698
correctly set strides for expanded/unsqueezed dimensions (#90341)
Dec 7, 2022
27adb7a
add save and load stats in memory_tracker (#90144)
zhaojuanmao Dec 8, 2022
3bbc35d
[primTorch] Add prim and ref for as_strided_scatter (#88426)
peterbell10 Dec 7, 2022
d751cde
Kineto activity fix (#89785)
mwootton Dec 8, 2022
9d0c8e8
C++17 friendly iterator implementation (#90379)
malfet Dec 8, 2022
4dc11af
[inductor] Remove .to from lowering (#90280)
desertfire Dec 7, 2022
6a3384c
Revert "[Quant][fx][bc-breaking] Make convert.py smaller (#90189)"
pytorchmergebot Dec 8, 2022
97a43a0
[quant] Add support for symmetric quant in executorch (#90304)
jerryzh168 Dec 6, 2022
6bf2cf1
Add missing mypy-nofollow.ini (#90179)
ezyang Dec 7, 2022
c473284
Type torch._dynamo.symbolic_convert (#90185)
ezyang Dec 7, 2022
7fc064e
Convert InstructionTranslatorGraphState and OutputGraphState to Named…
ezyang Dec 7, 2022
064724e
Type torch._dynamo.side_effects (#90202)
ezyang Dec 7, 2022
cb15e39
Rewrite dynamo cond() handling to not recursively call export (#90286)
ezyang Dec 7, 2022
8b880fd
Support pickle version 4 by adding missing ops (#90223)
atuljangra Dec 8, 2022
10075d4
Fix issue 38095 TODOs in gloo tests (#89985)
kit1980 Dec 8, 2022
28932f5
Introduce CUDA Device Assertions Infrastructure (#84609)
r-barnes Dec 8, 2022
7bfeffe
Fix issue 38095 TODO in test_jit_fuser_te.py (#90246)
kit1980 Dec 8, 2022
cbc8d58
Revert "Disable dynamo tracing torchrec.distributed (#90087)" (#90416)
wconstab Dec 7, 2022
9099726
[pthreadpool] Set max threadlimit to tsan limit (#89453)
digantdesai Dec 8, 2022
293ade7
Migrate PyTorch to C++17 (#85969)
malfet Dec 8, 2022
27951ad
[Checkpoint][2D][6/N] Add optimizer and update default_planner to cor…
wz337 Dec 8, 2022
9e39686
Always compile tiny graphs with AOTAutograd (#89775)
ezyang Dec 7, 2022
83c7fbd
Clean up dependancy for flatbuffer_loader (#86041)
qihqi Dec 8, 2022
d625385
Revert "[Inductor] More robust stride and offset extraction from inde…
pytorchmergebot Dec 8, 2022
fa33af8
Revert "add save and load stats in memory_tracker (#90144)"
pytorchmergebot Dec 8, 2022
4241615
[ONNX] Fix ignored small eps in layer normalization in fp16 (#89869)
titaiwangms Dec 7, 2022
9f4c6bf
Fix issue 38095 TODO in test_multiprocessing.py (#90335)
kit1980 Dec 8, 2022
7ce90d1
[PT-D][Easy] Reformat the optim code within PTD code base (#90399)
fduwjj Dec 8, 2022
a112f79
[LTC] Tweak LazyGraphExecutor for XLA (#90420)
alanwaketan Dec 8, 2022
454459d
[pruning][core][feature] Implement prune for structured pruning (#89777)
jcaip Dec 7, 2022
8fd4c1a
[Resubmit] state_dict_pre_hook (#90435)
rohan-varma Dec 8, 2022
7d01f8a
[LTC] Overlap data creation and ir_value setting (#90438)
alanwaketan Dec 8, 2022
14f2f98
Avoid overflow in tensorboard image summary (#90423)
ppwwyyxx Dec 8, 2022
ffc7f50
Remove deprecated call to tf.io.gfile.get_filesystem (#89832)
ppwwyyxx Dec 8, 2022
e248f08
Add a transform for positive-definite matrices. (#76777)
tillahoffmann Dec 8, 2022
418bdda
Upgrade oneDNN to v2.7.2 (#90051)
yanbing-j Dec 8, 2022
bf21b97
[inductor] Use decomposition for _to_copy (#90314)
desertfire Dec 8, 2022
a0905d9
[Composable API] Move test models to common file (#90385)
Dec 7, 2022
433f1cb
[Composable API] Refactor `test_fully_shard.py` to use common models …
Dec 7, 2022
538e478
[Composable API] Match `fully_shard()` comm. schedule with wrapper FS…
Dec 7, 2022
676c1ad
[Composable API][Easy] Use `policy=None` since that is supported (#90…
Dec 7, 2022
6df2e47
Implement correction argument in torch.masked.{std,var} (#87118)
peterbell10 Dec 7, 2022
eb18d90
Hybrid Sharded Data Parallel (#89915)
rohan-varma Dec 8, 2022
0802ef8
Remove TORCH_API from inline at::internal::lazy_init_num_thread (#89511)
ankurvdev Dec 8, 2022
8ded2f1
[inductor] Fallback for index with None in the middle of indices (#90…
desertfire Dec 7, 2022
a870dfb
Add support to foreach torch empty for bfloat16s (#90437)
drisspg Dec 8, 2022
456508d
Revert "Fix issue 38095 TODO in test_multiprocessing.py (#90335)"
pytorchmergebot Dec 8, 2022
3a55b5a
Add additional tests for view slice tensors (#86282)
DenisVieriu97 Dec 8, 2022
322cca2
[LTC] Tweak LazyTensor Class for XLATensor (#90363)
alanwaketan Dec 8, 2022
ba74555
Revert "[inductor] Use decomposition for _to_copy (#90314)"
pytorchmergebot Dec 8, 2022
924cd1c
skip flaky tests (rather than expectedFailure) (#90233)
ppwwyyxx Dec 8, 2022
a123579
Add feature flag for the autograd.Function extension (#89858)
zou3519 Dec 7, 2022
3cc6ba9
Add setup_context staticmethod to autograd.Function (#89859)
zou3519 Dec 7, 2022
a478efc
functorch.grad support for autograd.Function (#89860)
zou3519 Dec 7, 2022
74283c0
Refactor test_inductor_XXX to reduce code duplication (#90443)
ezyang Dec 8, 2022
376b30c
Add Pointwise Tag from pointwise set in DTensor, use in aot_autograd …
eellison Dec 7, 2022
83545cf
avoid fork in torch/__init__.py for deploy/multipy (#90492)
Dec 8, 2022
08e4b56
[rfc][pkg] check spec for module source before falling back to file i…
smacke Dec 8, 2022
a1b6433
[dynamo] Rewrite inplace addcdiv and inplace add (#90330)
mlazos Dec 8, 2022
dda4362
Track torch.compile calls (#90310)
williamwen42 Dec 8, 2022
8f685d8
Don't update vision hash on push (#90498)
clee2000 Dec 8, 2022
98ab4c9
Fix issue 38095 TODOs in test_quantized_tensor.py (#90344)
kit1980 Dec 8, 2022
d2c3082
[Ez] Omit HSDP Z2 from doc (#90503)
rohan-varma Dec 8, 2022
c2e2722
[20/N] Add recv_any_source custom op with CPU/CUDA implementations (#…
H-Huang Dec 7, 2022
6c735e7
[21/N] Add alltoall_base custom op with CPU/CUDA implementations (#89…
H-Huang Dec 7, 2022
57174c7
Deepcopy GraphModule in minifier (#90401)
williamwen42 Dec 8, 2022
10d686e
[Composable API][Easy] Fix some follow-ups (#90471)
Dec 8, 2022
a655da7
Fix bug in dynamic shapes multiply (#90336)
tugsbayasgalan Dec 7, 2022
f790592
Fix static initialization issue for static build (#90133)
ppwwyyxx Dec 9, 2022
672e05f
Bump version: 1.14.0->2.0.0 (#90491)
malfet Dec 9, 2022
a2b0e24
[quant][be] Remove special casing for getitem in prepare (#90393)
jerryzh168 Dec 7, 2022
a0e5322
[dynamo][ez] Change module type to str for easier downstream parsing …
jerryzh168 Dec 8, 2022
b25267c
[ONNX] Extend PR approver list (#90490)
BowenBao Dec 9, 2022
5931367
Fixed a couple of mistakes in type annotations in optim package (#90216)
mauvilsa Dec 9, 2022
20bb5d0
[vision hash update] update the pinned vision hash (#90513)
pytorchmergebot Dec 9, 2022
2c43abc
Dynamo, FX, Inductor Progress Bars (#88384)
msaroufim Dec 9, 2022
47c48bc
[threaded PG] fix long hang issue in testing (#90515)
XilunWu Dec 9, 2022
12e7c66
[ao] backend_config moving all to top (#88391)
HDCharles Dec 8, 2022
1916d2d
[ao] public vs private for ao.quantization._X (#88392)
HDCharles Dec 8, 2022
337638c
Implement checks for vmap escaped errors (#89585)
srossross Dec 9, 2022
eb9abd7
Reland "Add heirachical module names to torchFX graph.node" (#90205)
alexmsettle Dec 9, 2022
761c737
Add back support for PYTORCH_TEST_WITH_MPS (#66)
DenisVieriu97 Jul 19, 2022
fa09cf9
Add error messages for int64 non-available ops (#80)
abhudev Aug 11, 2022
a96aef3
Return input in addcmul/div if value is zero (#84)
razarmehr Aug 11, 2022
c4161df
Update allow and block lists (#88)
abhudev Aug 17, 2022
c51e86e
Include scalar params in caching key (#94)
abhudev Aug 25, 2022
848ef6c
Skip unsupported tests / Add supported mps dtypes (#98)
DenisVieriu97 Aug 26, 2022
3baf2b8
Exclude split from tests (#101)
abhudev Sep 1, 2022
1d63d4a
Update blocklist (#106)
abhudev Sep 12, 2022
bd098fe
Implement as_strided using existing MPSGraph Shape APIs (#109)
chrisbbayley Sep 27, 2022
860b771
Handle empty input in layer norm (#123)
abhudev Sep 29, 2022
5fc5802
Fix slice from view tensors (#129)
DenisVieriu97 Oct 1, 2022
0105c9b
Cast dot inputs to int32 when needed (#121)
abhudev Oct 1, 2022
1b32ddc
Raise error for dot int64 input (#130)
abhudev Oct 3, 2022
bf73428
Add MPSHooks interface to enable accessing specific MPS functions glo…
razarmehr Oct 4, 2022
bc3ff0a
Softplus and einsum are crashing.
kulinseth Oct 5, 2022
0f112d4
Add MPSGenerator to enable custom random number generators on MPS bac…
razarmehr Oct 5, 2022
b294ea8
Register unfold key for MPS (#134)
DenisVieriu97 Oct 11, 2022
0ec4818
Fix the base shape of view tensor in as_strided_mps (#135)
razarmehr Oct 13, 2022
7e2dfa8
Fix the assert in Garbage Collector (#139)
razarmehr Oct 14, 2022
f37bd4a
Exclude long dtype from reduction ops (min/max) (#138)
DenisVieriu97 Oct 17, 2022
3752d15
Replace the explicit commit in View ops with adaptive commit (#136)
razarmehr Oct 17, 2022
02b98f5
[MPS] Build fix due to MPSGenerator changes in multinomial
kulinseth Oct 14, 2022
5829158
Don't reset the Graph state. (#141)
kulinseth Oct 18, 2022
60e1262
Add suport for casting updatesTensor to match outputTensor dtype dire…
DenisVieriu97 Oct 19, 2022
3006e90
Add bincount support for mps (#137)
DenisVieriu97 Oct 19, 2022
18f7e82
Remove unused variable (build failure with WERROR=1) (#143)
DenisVieriu97 Oct 19, 2022
9f826ff
Disable the padding dimension checks for constant_pad (#144)
razarmehr Oct 21, 2022
01e84f1
Add support for copying cpu tensors into strided mps tensors (#142)
DenisVieriu97 Oct 24, 2022
73ef70b
Register norm_dtype_out for mps (#149)
DenisVieriu97 Oct 26, 2022
58e3a66
Improve the performance of torch.linear() (#148)
razarmehr Oct 26, 2022
6e9bb6d
Fix boolean casting in Unary ops (#150)
razarmehr Oct 26, 2022
ab43946
Fix data type issues with log1p() op (#151)
razarmehr Oct 26, 2022
cd8f317
Fix data type issues with sigmoid() op (#152)
razarmehr Oct 27, 2022
0ccd72e
Add check for macos 13.0 (#153)
DenisVieriu97 Oct 27, 2022
176051f
Fix gpu timeouts caused by bincount reduction large workload (#155)
DenisVieriu97 Oct 28, 2022
edb0870
Add nonzero support for mps (#133)
DenisVieriu97 Oct 28, 2022
30d6b04
Ignore cumulativeSumWithTensor missing selector warning on macos Mont…
DenisVieriu97 Oct 28, 2022
7deee6d
Add floor_divide() op and its test case (#157)
razarmehr Oct 28, 2022
a9408fc
Fix the type cast issue with Binary Ops (#158)
razarmehr Oct 31, 2022
c81a2c9
Skip gather/blit calls in case of strided output (#147)
DenisVieriu97 Oct 31, 2022
ffeac72
Solve contiguos view tensors using arrayViews instead of blits (#146)
DenisVieriu97 Nov 1, 2022
3e84439
Add support for torch.linalg.cross (#159)
DenisVieriu97 Nov 1, 2022
abf8edb
Add support for randperm (#160)
DenisVieriu97 Nov 2, 2022
ad91026
Add mps implementation for unique(no dim support) and unique_consecut…
chrisbbayley Nov 3, 2022
0b533ea
NLL loss TestConsistency seems to be working.
kulinseth Nov 4, 2022
733980f
Move several binary ops from block list to allow list (#165)
razarmehr Nov 8, 2022
a3e9ff5
Move passing tests to ALLOWLIST_OP (#168)
DenisVieriu97 Nov 8, 2022
3670890
Fix padding dimension issues with constant_pad_nd (#169)
razarmehr Nov 8, 2022
66c199c
Dev/skotapati/inverse (#167)
skotapati Nov 9, 2022
507cb59
Fix gather for uint8 data type (index_select) (#170)
DenisVieriu97 Nov 9, 2022
5cfba91
Add support for cdist on mps (#164)
DenisVieriu97 Nov 9, 2022
7622ed2
Fix MPSGraph casting issue to MPSDataTypeBool in masked_fill op (#166)
DenisVieriu97 Nov 9, 2022
64fe14b
Move some working tests from Block list to Allow list (#171)
razarmehr Nov 9, 2022
aaf9078
Unregister put() for MPS backend (#172)
razarmehr Nov 10, 2022
5040edf
Cast to unsigned type when comparing signed vs. unsigned integers in …
razarmehr Nov 11, 2022
bced1f1
Fix the skipIfMps include in header. (#175)
kulinseth Nov 11, 2022
94d3f2d
Fix the cast and shape issues with Scatter and Gather ops (#174)
razarmehr Nov 15, 2022
9d1075f
Workaround for the casting issue from floating point to uint8 (#176)
razarmehr Nov 15, 2022
f485193
Second tensor argument of tensor_split() must be on CPU (#177)
razarmehr Nov 16, 2022
599a439
Fix the Channels last bug with GradientWithInput. (#179)
kulinseth Nov 17, 2022
ec925fb
Fix cast issue in scatter() with uint8 type (#178)
razarmehr Nov 18, 2022
3887415
Fallback on gather code to solve view tensors when a slice is followe…
DenisVieriu97 Nov 19, 2022
d1f4873
fix huberloss for float16 (#185)
ssaladis Nov 30, 2022
8e32de4
Fix GPU timeout caused by negative padding in backward pass (#184)
razarmehr Dec 1, 2022
d84cf3d
Fix std and var for float16 and float32 (#186)
Ronian526 Dec 1, 2022
6ccf197
Fix correctness issues with Upsample 1D and 2D (#183)
razarmehr Dec 6, 2022
36658d2
Dev/denis/reduce ops multi axes support (#188)
DenisVieriu97 Dec 7, 2022
e864e8c
Convert grad output always to NCHW in mps_convolution_backward_input …
DenisVieriu97 Dec 8, 2022
e26a5f5
Add more testcases for convolution backward input pass (#191)
DenisVieriu97 Dec 8, 2022
fadc767
Dev/kulin/nll (#189)
kulinseth Dec 8, 2022
73b53ca
Fix conv grad weights (#192)
DenisVieriu97 Dec 8, 2022
9c1bdc9
implement sgn_out for mps
Berzeg Sep 9, 2022
3071065
sgn_out actually works, mostly, on mps
Berzeg Sep 14, 2022
b6a32a9
Reduce running time for cdist tests (#194)
DenisVieriu97 Dec 8, 2022
699ac43
Clamp argmin / argmax min indicies values to 0 (#193)
DenisVieriu97 Dec 8, 2022
cff41d8
Implement nan_to_num() for MPS backend (#196)
razarmehr Dec 9, 2022
7f42100
Build fixes.
kulinseth Dec 10, 2022
0fa37fc
Fix correctness issues with Pooling ops (#187)
razarmehr Dec 9, 2022
499ae54
Removed masked_var & masked_std from blocklist (#195)
skotapati Dec 9, 2022
9c46f46
Move passing tests from blocklist to allowlist; arrange failing tests…
DenisVieriu97 Dec 10, 2022
e09f561
Ventura build fix (#200)
DenisVieriu97 Dec 12, 2022
6ad73ce
Dev/kulin/einsum (#201)
kulinseth Dec 12, 2022
6ac05df
Fix masked_fill and masked_mean hangs on macOS Monterey (#202)
DenisVieriu97 Dec 13, 2022
9eb018a
Remove macOS13 duplicate check (#204)
DenisVieriu97 Dec 14, 2022
2fb4554
Workaround for the right padding bug in Monterey (#206)
razarmehr Dec 14, 2022
17cd9c0
Fix expand pattern when strides of src/dst are not matching (#203)
DenisVieriu97 Dec 14, 2022
d3c4c35
Enable high watermark ratio to limit the memory allocations (#197)
razarmehr Dec 14, 2022
990c3a2
Fix mps constant pad (#89864)
qqaatw Dec 13, 2022
314d316
Inverse op fallback (op/inverse) (#207)
DenisVieriu97 Dec 14, 2022
fd1139e
Display error for int64 pooling inputs on Monterey (#208)
razarmehr Dec 15, 2022
92ec9a6
Move as_strided_scatter to random output blocklist (#198)
Ronian526 Dec 15, 2022
21833f2
Fix the crash in sgn_out_mps() (#209)
razarmehr Dec 15, 2022
807e192
Fix the crash in max_out_mps() caused by cached key conflict (#211)
razarmehr Dec 15, 2022
a24e1a1
Replace assert() with torch_check() in reduce ops (#210)
razarmehr Dec 15, 2022
334a1e2
Fix ops with bool issues in macOS Monterey (#205)
DenisVieriu97 Dec 16, 2022
fd50f46
Fix the ChannelsLast memory format in cat_out_mps() (#212)
razarmehr Dec 16, 2022
60a3399
Add rtol/atol to the assertEqual() in gradient results check (#213)
razarmehr Dec 16, 2022
2e1dac7
Fix unrecognized selector build warnings in macOS Monterey (#215)
DenisVieriu97 Dec 18, 2022
c8f17a5
Fix randperm CPU fallback for macOS Monterey (#214)
DenisVieriu97 Dec 18, 2022
699a05c
Fix the error with high watermark value on x86 (#216)
razarmehr Dec 21, 2022
73d086c
Fix uint8 issues in BinaryOps for Monterey (#217)
razarmehr Dec 22, 2022
e6c405f
[MPS] Add MPSHooks interface to enable accessing MPS functions global…
razarmehr Dec 21, 2022
b2cf2c5
Remove unused pragma (#218)
DenisVieriu97 Jan 3, 2023
1d51d54
[MPS] Implement MPSGenerator to enable manual random seeding (#91348)
razarmehr Jan 3, 2023
bf4e7d9
Remove the unused code for view lists in OperationUtils.h (#219)
razarmehr Jan 3, 2023
d5e2e4e
Fix build failure (#220)
DenisVieriu97 Jan 3, 2023
7f4cf84
Changed conditions for handling view tensors in order to resolve test…
skotapati Jan 6, 2023
bcc9f63
Fix the crash in nan_to_num() with Float16 data type (#226)
razarmehr Jan 9, 2023
dabe321
Fix min/max_reduction_with_dim ops (#224)
DenisVieriu97 Jan 9, 2023
8c0bb00
Make intermediate type for cumsum ScalarType::Int (#221)
abhudev Jan 9, 2023
b323910
Add 2d grid sampler (#180)
DenisVieriu97 Jan 10, 2023
c071c85
Regenerate TestConsistency (#225)
DenisVieriu97 Jan 12, 2023
c87cf48
Remove test consistency duplicates (#231)
DenisVieriu97 Jan 13, 2023
96d6f97
Fix macos subversion check for grid sampler fallback (#230)
DenisVieriu97 Jan 13, 2023
f23cc91
Correctly apply weights to oneHotTensor in NLLLoss
Jan 14, 2023
559e7dd
Revert "Correctly apply weights to oneHotTensor in NLLLoss"
Jan 14, 2023
df8e01a
Correctly apply weights to oneHotTensor in NLLLoss (#233)
skotapati Jan 14, 2023
26371a4
Use the reshaped weights (#234)
DenisVieriu97 Jan 14, 2023
fb315e8
Add MPSAllocatorInterface to access methods of MPSAllocator (#227)
razarmehr Jan 17, 2023
f079740
Add triangular solve op through MPSMatrixSolveTriangular (#228)
DenisVieriu97 Jan 17, 2023
5d9be8c
Added zero check to inverse op, resolving crash seen in inverse & mat…
skotapati Jan 18, 2023
497300d
Remove nn.functional.conv_transpose2d from Blocklist (#232)
Ronian526 Jan 18, 2023
6dc2203
Fix slice followed by reshape (#237)
DenisVieriu97 Jan 18, 2023
d88dc8b
Revert "Added zero check to inverse op, resolving crash seen in inver…
kulinseth Jan 18, 2023
4a4d121
Remove square from block list (#229)
Ronian526 Jan 18, 2023
fd0b7ec
- fix batch_norm_mps_out key (#239)
Ronian526 Jan 19, 2023
e656d72
Fix index_add type issue (#235)
Ronian526 Jan 19, 2023
892bc60
- unlock _native_batch_norm_legit after resolving native_batch_norm (…
Ronian526 Jan 19, 2023
1d50f47
cherry-pick remainder op from upstream (#244)
DenisVieriu97 Jan 20, 2023
d24fe97
Fix arange_mps_out for empty tensor (#245)
DenisVieriu97 Jan 20, 2023
3a8490d
Fix nonzero for empty tensors (#248)
DenisVieriu97 Jan 20, 2023
cfb518c
Skip compilation of mps grid sampler for macos older than 13.1 (#247)
DenisVieriu97 Jan 20, 2023
d71061e
Fix relative paths in test_mps (#246)
DenisVieriu97 Jan 23, 2023
f81f2ac
Fix nn.functional.gelu (#249)
Ronian526 Jan 23, 2023
c506a9e
Add Python Module interface for MPS backend (#251)
razarmehr Jan 24, 2023
a6b4bc5
Enable MPS CI runners (#252)
DenisVieriu97 Jan 24, 2023
0408b26
Add additional checks for reshaped views (#250)
DenisVieriu97 Jan 25, 2023
940401f
Fix blocklist dict search (#255)
DenisVieriu97 Jan 26, 2023
a1e0d78
Fix the crash with hardswish_backward (#256)
razarmehr Jan 26, 2023
1eca176
Fix batch_norm_backward_mps (#257)
Ronian526 Jan 27, 2023
d86a428
Use low precision for FP16 for divtrunc_rounding (#263)
razarmehr Jan 27, 2023
ca427e8
Fix floor_divide (#243)
Ronian526 Jan 27, 2023
628cecd
Fix and unblock TestConsistency for median (#241)
Ronian526 Jan 27, 2023
95d95c1
Remove softplus from blocklist (#253)
Ronian526 Jan 27, 2023
0a17c0b
Calculate nonzero count inside nonzero op (#260)
DenisVieriu97 Jan 27, 2023
a82947b
Add support for negative dimensions in cumsum (#238)
abhudev Jan 28, 2023
a2af8c4
Fix test mps key search in blocklist dicts (#265)
DenisVieriu97 Jan 28, 2023
a88a612
Fix layer norm (#262)
Ronian526 Jan 30, 2023
d3d4b9d
Fix the crash in View ops when slicing wrong lengths (#267)
razarmehr Jan 30, 2023
4bea291
Adding blocklist for macOS 12 (#266)
Ronian526 Jan 31, 2023
8d873bb
Fix is_mps_available() regression on non-MPS devices (#268)
razarmehr Jan 31, 2023
11651e1
Add im2col and col2im to Fallback as they are mostly used in Preproce…
kulinseth Jan 31, 2023
42e7d51
Fix lintrunner (#271)
DenisVieriu97 Jan 31, 2023
d1b64eb
Update gradient allowlist and blocklist (#269)
DenisVieriu97 Jan 31, 2023
c00bcc7
- block conv_transpose3d, giving error messages (#272)
Ronian526 Feb 1, 2023
1fff5b5
cat with memory format
shuhand0 Feb 3, 2023
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
2 changes: 1 addition & 1 deletion .bazelrc
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
build --cxxopt=--std=c++14
build --cxxopt=--std=c++17
build --copt=-I.
# Bazel does not support including its cc_library targets as system
# headers. We work around this for generated code
Expand Down
468 changes: 468 additions & 0 deletions .circleci/README.md

Large diffs are not rendered by default.

98 changes: 15 additions & 83 deletions .circleci/docker/build.sh
Original file line number Diff line number Diff line change
Expand Up @@ -33,7 +33,7 @@ function extract_all_from_image_name() {
if [ "x${name}" = xpy ]; then
vername=ANACONDA_PYTHON_VERSION
fi
# skip non-conforming fields such as "pytorch", "linux" or "xenial" without version string
# skip non-conforming fields such as "pytorch", "linux" or "bionic" without version string
if [ -n "${name}" ]; then
extract_version_from_image_name "${name}" "${vername}"
fi
Expand All @@ -46,11 +46,7 @@ if [[ "$image" == *xla* ]]; then
exit 0
fi

if [[ "$image" == *-xenial* ]]; then
UBUNTU_VERSION=16.04
elif [[ "$image" == *-artful* ]]; then
UBUNTU_VERSION=17.10
elif [[ "$image" == *-bionic* ]]; then
if [[ "$image" == *-bionic* ]]; then
UBUNTU_VERSION=18.04
elif [[ "$image" == *-focal* ]]; then
UBUNTU_VERSION=20.04
Expand Down Expand Up @@ -79,56 +75,17 @@ elif [[ "$image" == *rocm* ]]; then
DOCKERFILE="${OS}-rocm/Dockerfile"
fi

if [[ "$image" == *xenial* ]] || [[ "$image" == *bionic* ]]; then
CMAKE_VERSION=3.13.5
fi
# CMake 3.18 is needed to support CUDA17 language variant
CMAKE_VERSION=3.18.5

TRAVIS_DL_URL_PREFIX="https://s3.amazonaws.com/travis-python-archives/binaries/ubuntu/14.04/x86_64"
_UCX_COMMIT=31e74cac7bee0ef66bef2af72e7d86d9c282e5ab
_UCC_COMMIT=12944da33f911daf505d9bbc51411233d0ed85e1
_UCC_COMMIT=1c7a7127186e7836f73aafbd7697bbc274a77eee

# It's annoying to rename jobs every time you want to rewrite a
# configuration, so we hardcode everything here rather than do it
# from scratch
case "$image" in
pytorch-linux-xenial-py3.8)
ANACONDA_PYTHON_VERSION=3.8
GCC_VERSION=7
# Do not install PROTOBUF, DB, and VISION as a test
;;
pytorch-linux-xenial-py3.7-gcc7.2)
ANACONDA_PYTHON_VERSION=3.7
GCC_VERSION=7
# Do not install PROTOBUF, DB, and VISION as a test
;;
pytorch-linux-xenial-py3.7-gcc7)
ANACONDA_PYTHON_VERSION=3.7
GCC_VERSION=7
PROTOBUF=yes
DB=yes
VISION=yes
;;
pytorch-linux-xenial-cuda10.2-cudnn7-py3-gcc7)
CUDA_VERSION=10.2
CUDNN_VERSION=7
ANACONDA_PYTHON_VERSION=3.7
GCC_VERSION=7
PROTOBUF=yes
DB=yes
VISION=yes
KATEX=yes
;;
pytorch-linux-xenial-cuda11.3-cudnn8-py3-gcc7)
CUDA_VERSION=11.3.0 # Deviating from major.minor to conform to nvidia's Docker image names
CUDNN_VERSION=8
TENSORRT_VERSION=8.0.1.6
ANACONDA_PYTHON_VERSION=3.7
GCC_VERSION=7
PROTOBUF=yes
DB=yes
VISION=yes
KATEX=yes
;;
pytorch-linux-bionic-cuda11.3-cudnn8-py3-clang9)
CUDA_VERSION=11.3.0 # Deviating from major.minor to conform to nvidia's Docker image names
CUDNN_VERSION=8
Expand Down Expand Up @@ -167,20 +124,6 @@ case "$image" in
UCC_COMMIT=${_UCC_COMMIT}
CONDA_CMAKE=yes
;;
pytorch-linux-xenial-py3-clang5-asan)
ANACONDA_PYTHON_VERSION=3.7
CLANG_VERSION=5.0
PROTOBUF=yes
DB=yes
VISION=yes
;;
pytorch-linux-xenial-py3-clang7-asan)
ANACONDA_PYTHON_VERSION=3.7
CLANG_VERSION=7
PROTOBUF=yes
DB=yes
VISION=yes
;;
pytorch-linux-focal-py3-clang7-asan)
ANACONDA_PYTHON_VERSION=3.7
CLANG_VERSION=7
Expand All @@ -189,13 +132,6 @@ case "$image" in
VISION=yes
CONDA_CMAKE=yes
;;
pytorch-linux-xenial-py3-clang7-onnx)
ANACONDA_PYTHON_VERSION=3.7
CLANG_VERSION=7
PROTOBUF=yes
DB=yes
VISION=yes
;;
pytorch-linux-focal-py3-clang10-onnx)
ANACONDA_PYTHON_VERSION=3.7
CLANG_VERSION=10
Expand All @@ -204,23 +140,16 @@ case "$image" in
VISION=yes
CONDA_CMAKE=yes
;;
pytorch-linux-xenial-py3-clang5-android-ndk-r19c)
pytorch-linux-focal-py3-clang7-android-ndk-r19c)
ANACONDA_PYTHON_VERSION=3.7
CLANG_VERSION=5.0
CLANG_VERSION=7
LLVMDEV=yes
PROTOBUF=yes
ANDROID=yes
ANDROID_NDK_VERSION=r19c
GRADLE_VERSION=6.8.3
NINJA_VERSION=1.9.0
;;
pytorch-linux-xenial-py3.7-clang7)
ANACONDA_PYTHON_VERSION=3.7
CLANG_VERSION=7
PROTOBUF=yes
DB=yes
VISION=yes
;;
pytorch-linux-bionic-py3.7-clang9)
ANACONDA_PYTHON_VERSION=3.7
CLANG_VERSION=9
Expand Down Expand Up @@ -259,17 +188,17 @@ case "$image" in
VISION=yes
CONDA_CMAKE=yes
;;
pytorch-linux-focal-rocm5.1-py3.7)
ANACONDA_PYTHON_VERSION=3.7
pytorch-linux-focal-rocm5.1-py3.8)
ANACONDA_PYTHON_VERSION=3.8
GCC_VERSION=9
PROTOBUF=yes
DB=yes
VISION=yes
ROCM_VERSION=5.1.1
CONDA_CMAKE=yes
;;
pytorch-linux-focal-rocm5.2-py3.7)
ANACONDA_PYTHON_VERSION=3.7
pytorch-linux-focal-rocm5.2-py3.8)
ANACONDA_PYTHON_VERSION=3.8
GCC_VERSION=9
PROTOBUF=yes
DB=yes
Expand All @@ -279,7 +208,6 @@ case "$image" in
;;
pytorch-linux-focal-py3.7-gcc7)
ANACONDA_PYTHON_VERSION=3.7
CMAKE_VERSION=3.16.9 # Required for precompiled header support
GCC_VERSION=7
PROTOBUF=yes
DB=yes
Expand Down Expand Up @@ -320,6 +248,10 @@ case "$image" in
fi
if [[ "$image" == *rocm* ]]; then
extract_version_from_image_name rocm ROCM_VERSION
NINJA_VERSION=1.9.0
fi
if [[ "$image" == *centos7* ]]; then
NINJA_VERSION=1.10.2
fi
if [[ "$image" == *gcc* ]]; then
extract_version_from_image_name gcc GCC_VERSION
Expand Down
9 changes: 7 additions & 2 deletions .circleci/docker/common/install_base.sh
Original file line number Diff line number Diff line change
Expand Up @@ -68,7 +68,10 @@ install_ubuntu() {
sudo \
vim \
jq \
libtool
libtool \
vim \
unzip \
gdb

# Should resolve issues related to various apt package repository cert issues
# see: https://github.com/pytorch/pytorch/issues/65931
Expand Down Expand Up @@ -126,7 +129,9 @@ install_centos() {
opencv-devel \
sudo \
wget \
vim
vim \
unzip \
gdb

# Cleanup
yum clean all
Expand Down
3 changes: 0 additions & 3 deletions .circleci/docker/common/install_conda.sh
Original file line number Diff line number Diff line change
Expand Up @@ -104,9 +104,6 @@ if [ -n "$ANACONDA_PYTHON_VERSION" ]; then
conda_install magma-cuda$(TMP=${CUDA_VERSION/./};echo ${TMP%.*[0-9]}) -c pytorch
fi

# TODO: This isn't working atm
conda_install nnpack -c killeent

# Install some other packages, including those needed for Python test reporting
pip_install -r /opt/conda/requirements-ci.txt

Expand Down
4 changes: 2 additions & 2 deletions .circleci/docker/common/install_cudnn.sh
Original file line number Diff line number Diff line change
Expand Up @@ -6,9 +6,9 @@ if [[ ${CUDNN_VERSION} == 8 ]]; then
CUDNN_NAME="cudnn-linux-x86_64-8.3.2.44_cuda11.5-archive"
if [[ ${CUDA_VERSION:0:4} == "11.7" ]]; then
CUDNN_NAME="cudnn-linux-x86_64-8.5.0.96_cuda11-archive"
curl -OLs https://ossci-linux.s3.amazonaws.com/${CUDNN_NAME}.tar.xz
curl --retry 3 -OLs https://ossci-linux.s3.amazonaws.com/${CUDNN_NAME}.tar.xz
else
curl -OLs https://developer.download.nvidia.com/compute/redist/cudnn/v8.3.2/local_installers/11.5/${CUDNN_NAME}.tar.xz
curl --retry 3 -OLs https://developer.download.nvidia.com/compute/redist/cudnn/v8.3.2/local_installers/11.5/${CUDNN_NAME}.tar.xz
fi

tar xf ${CUDNN_NAME}.tar.xz
Expand Down
4 changes: 2 additions & 2 deletions .circleci/docker/common/install_docs_reqs.sh
Original file line number Diff line number Diff line change
Expand Up @@ -7,10 +7,10 @@ if [ -n "$KATEX" ]; then
# Ignore error if gpg-agent doesn't exist (for Ubuntu 16.04)
apt-get install -y gpg-agent || :

curl -sL https://deb.nodesource.com/setup_12.x | sudo -E bash -
curl --retry 3 -sL https://deb.nodesource.com/setup_12.x | sudo -E bash -
sudo apt-get install -y nodejs

curl -sS https://dl.yarnpkg.com/debian/pubkey.gpg | sudo apt-key add -
curl --retry 3 -sS https://dl.yarnpkg.com/debian/pubkey.gpg | sudo apt-key add -
echo "deb https://dl.yarnpkg.com/debian/ stable main" | sudo tee /etc/apt/sources.list.d/yarn.list

apt-get update
Expand Down
2 changes: 1 addition & 1 deletion .circleci/docker/common/install_protobuf.sh
Original file line number Diff line number Diff line change
Expand Up @@ -12,7 +12,7 @@ install_protobuf_317() {
# g++: error: ./../lib64/crti.o: No such file or directory
ln -s /usr/lib64 "$pb_dir/lib64"

curl -LO "https://github.com/protocolbuffers/protobuf/releases/download/v3.17.3/protobuf-all-3.17.3.tar.gz"
curl -LO "https://github.com/protocolbuffers/protobuf/releases/download/v3.17.3/protobuf-all-3.17.3.tar.gz" --retry 3
tar -xvz -C "$pb_dir" --strip-components 1 -f protobuf-all-3.17.3.tar.gz
# -j6 to balance memory usage and speed.
# naked `-j` seems to use too much memory.
Expand Down
22 changes: 20 additions & 2 deletions .circleci/docker/common/install_rocm.sh
Original file line number Diff line number Diff line change
Expand Up @@ -29,7 +29,12 @@ install_ubuntu() {
if [[ $(ver $ROCM_VERSION) -ge $(ver 4.5) ]]; then
# Add amdgpu repository
UBUNTU_VERSION_NAME=`cat /etc/os-release | grep UBUNTU_CODENAME | awk -F= '{print $2}'`
local amdgpu_baseurl="https://repo.radeon.com/amdgpu/${AMDGPU_VERSIONS[$ROCM_VERSION]}/ubuntu"
local amdgpu_baseurl
if [[ $(ver $ROCM_VERSION) -ge $(ver 5.3) ]]; then
amdgpu_baseurl="https://repo.radeon.com/amdgpu/${ROCM_VERSION}/ubuntu"
else
amdgpu_baseurl="https://repo.radeon.com/amdgpu/${AMDGPU_VERSIONS[$ROCM_VERSION]}/ubuntu"
fi
echo "deb [arch=amd64] ${amdgpu_baseurl} ${UBUNTU_VERSION_NAME} main" > /etc/apt/sources.list.d/amdgpu.list
fi

Expand All @@ -38,6 +43,10 @@ install_ubuntu() {
ROCM_REPO="xenial"
fi

if [[ $(ver $ROCM_VERSION) -ge $(ver 5.3) ]]; then
ROCM_REPO="${UBUNTU_VERSION_NAME}"
fi

# Add rocm repository
wget -qO - http://repo.radeon.com/rocm/rocm.gpg.key | apt-key add -
local rocm_baseurl="http://repo.radeon.com/rocm/apt/${ROCM_VERSION}"
Expand Down Expand Up @@ -78,7 +87,16 @@ install_centos() {

if [[ $(ver $ROCM_VERSION) -ge $(ver 4.5) ]]; then
# Add amdgpu repository
local amdgpu_baseurl="https://repo.radeon.com/amdgpu/${AMDGPU_VERSIONS[$ROCM_VERSION]}/rhel/7.9/main/x86_64"
local amdgpu_baseurl
if [[ $OS_VERSION == 9 ]]; then
amdgpu_baseurl="https://repo.radeon.com/amdgpu/${AMDGPU_VERSIONS[$ROCM_VERSION]}/rhel/9.0/main/x86_64"
else
if [[ $(ver $ROCM_VERSION) -ge $(ver 5.3) ]]; then
amdgpu_baseurl="https://repo.radeon.com/amdgpu/${ROCM_VERSION}/rhel/7.9/main/x86_64"
else
amdgpu_baseurl="https://repo.radeon.com/amdgpu/${AMDGPU_VERSIONS[$ROCM_VERSION]}/rhel/7.9/main/x86_64"
fi
fi
echo "[AMDGPU]" > /etc/yum.repos.d/amdgpu.repo
echo "name=AMDGPU" >> /etc/yum.repos.d/amdgpu.repo
echo "baseurl=${amdgpu_baseurl}" >> /etc/yum.repos.d/amdgpu.repo
Expand Down
7 changes: 6 additions & 1 deletion .circleci/docker/requirements-ci.txt
Original file line number Diff line number Diff line change
Expand Up @@ -159,8 +159,13 @@ pytest-shard
#Pinned versions:
#test that import:

pytest-flakefinder==1.1.0
#Description: plugin for rerunning tests a fixed number of times in pytest
#Pinned versions: 1.1.0
#test that import:

pytest-rerunfailures
#Description: plugin for rerunning tests in pytest
#Description: plugin for rerunning failure tests in pytest
#Pinned versions:
#test that import:

Expand Down
4 changes: 2 additions & 2 deletions .circleci/scripts/binary_install_miniconda.sh
Original file line number Diff line number Diff line change
Expand Up @@ -31,9 +31,9 @@ fi

conda_sh="$workdir/install_miniconda.sh"
if [[ "$(uname)" == Darwin ]]; then
curl --retry 3 -o "$conda_sh" https://repo.anaconda.com/miniconda/Miniconda3-latest-MacOSX-x86_64.sh
curl --retry 3 --retry-all-errors -o "$conda_sh" https://repo.anaconda.com/miniconda/Miniconda3-latest-MacOSX-x86_64.sh
else
curl --retry 3 -o "$conda_sh" https://repo.anaconda.com/miniconda/Miniconda3-latest-Linux-x86_64.sh
curl --retry 3 --retry-all-errors -o "$conda_sh" https://repo.anaconda.com/miniconda/Miniconda3-latest-Linux-x86_64.sh
fi
chmod +x "$conda_sh"
"$conda_sh" -b -p "$MINICONDA_ROOT"
Expand Down
2 changes: 1 addition & 1 deletion .circleci/scripts/binary_ios_upload.sh
Original file line number Diff line number Diff line change
Expand Up @@ -33,7 +33,7 @@ fi
cp ${PROJ_ROOT}/LICENSE ${ZIP_DIR}/
# zip the library
export DATE="$(date -u +%Y%m%d)"
export IOS_NIGHTLY_BUILD_VERSION="1.14.0.${DATE}"
export IOS_NIGHTLY_BUILD_VERSION="2.0.0.${DATE}"
if [ "${BUILD_LITE_INTERPRETER}" == "1" ]; then
# libtorch_lite_ios_nightly_1.11.0.20210810.zip
ZIPFILE="libtorch_lite_ios_nightly_${IOS_NIGHTLY_BUILD_VERSION}.zip"
Expand Down
2 changes: 1 addition & 1 deletion .circleci/scripts/binary_linux_test.sh
Original file line number Diff line number Diff line change
Expand Up @@ -98,7 +98,7 @@ if [[ "$PACKAGE_TYPE" == conda ]]; then
conda install \${EXTRA_CONDA_FLAGS} -y "\$pkg" --offline
)
elif [[ "$PACKAGE_TYPE" != libtorch ]]; then
pip install "\$pkg"
pip install "\$pkg" --extra-index-url "https://download.pytorch.org/whl/nightly/${DESIRED_CUDA}"
retry pip install -q future numpy protobuf typing-extensions six
fi
if [[ "$PACKAGE_TYPE" == libtorch ]]; then
Expand Down
2 changes: 1 addition & 1 deletion .circleci/scripts/binary_populate_env.sh
Original file line number Diff line number Diff line change
Expand Up @@ -59,7 +59,7 @@ PIP_UPLOAD_FOLDER='nightly/'
# We put this here so that OVERRIDE_PACKAGE_VERSION below can read from it
export DATE="$(date -u +%Y%m%d)"
#TODO: We should be pulling semver version from the base version.txt
BASE_BUILD_VERSION="1.14.0.dev$DATE"
BASE_BUILD_VERSION="2.0.0.dev$DATE"
# Change BASE_BUILD_VERSION to git tag when on a git tag
# Use 'git -C' to make doubly sure we're in the correct directory for checking
# the git tag
Expand Down
5 changes: 5 additions & 0 deletions .circleci/scripts/build_android_gradle.sh
Original file line number Diff line number Diff line change
Expand Up @@ -20,6 +20,11 @@ do
touch "$file" || true
done < <(find /var/lib/jenkins/.gradle -type f -print0)

# Patch pocketfft (as Android does not have aligned_alloc even if compiled with c++17
if [ -f ~/workspace/third_party/pocketfft/pocketfft_hdronly.h ]; then
sed -i -e "s/#if __cplusplus >= 201703L/#if 0/" ~/workspace/third_party/pocketfft/pocketfft_hdronly.h
fi

export GRADLE_LOCAL_PROPERTIES=~/workspace/android/local.properties
rm -f $GRADLE_LOCAL_PROPERTIES
echo "sdk.dir=/opt/android/sdk" >> $GRADLE_LOCAL_PROPERTIES
Expand Down
2 changes: 1 addition & 1 deletion .circleci/scripts/driver_update.bat
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
set "DRIVER_DOWNLOAD_LINK=https://s3.amazonaws.com/ossci-windows/452.39-data-center-tesla-desktop-win10-64bit-international.exe"
curl --retry 3 -kL %DRIVER_DOWNLOAD_LINK% --output 452.39-data-center-tesla-desktop-win10-64bit-international.exe
curl --retry 3 --retry-all-errors -kL %DRIVER_DOWNLOAD_LINK% --output 452.39-data-center-tesla-desktop-win10-64bit-international.exe
if errorlevel 1 exit /b 1

start /wait 452.39-data-center-tesla-desktop-win10-64bit-international.exe -s -noreboot
Expand Down
3 changes: 3 additions & 0 deletions .circleci/scripts/python_doc_push_script.sh
Original file line number Diff line number Diff line change
Expand Up @@ -135,6 +135,9 @@ git commit -m "Generate Python docs from pytorch/pytorch@${GITHUB_SHA}" || true
git status

if [[ "${WITH_PUSH:-}" == true ]]; then
# push to a temp branch first to trigger CLA check and satisfy branch protections
git push -u origin HEAD:pytorchbot/temp-branch-py -f
sleep 30
git push -u origin "${branch}"
fi

Expand Down
Loading