Skip to content

Content Size (self.size) remaining at 0 for certain zarr-related assets #2067

@aaronkanzer

Description

@aaronkanzer

Issue:

Some zarr assets are registering with a size of 0 even though memory size is well above -- this issue was noticed when attempting to call dandi download upon Dandiset 000719 -- still investigating if other zarr-containing datasets have similar issues.

Seems that there are 119 ZarrArchive objects registered with size of 0:

>>> ZarrArchive.objects.filter(size=0).count()
119

@waxlamp @satra @kabilar -- any idea if this could be intentional behavior? Seems that the dandisets that contain these ZarrArchives of size 0 have variable dates of posting on DANDI Archive.

e.g. https://api.dandiarchive.org/api/dandisets/000108/versions/draft/assets/db2fe61f-2874-444d-b952-02234d00f2ba/ is from a few years ago and contains a similar size of 0 on sub-SChmi53/ses-20220114h19m16s46/micr/sub-SChmi53_ses-20220114h19m16s46_sample-13_stain-LEC_run-1_chunk-3_SPIM.ome.zarr

Cc @yarikoptic @jwodder -- is this something that I should also cross-post in dandi-cli repo for reference?

Steps to replicate initial bug noticed

  1. Visit https://dandiarchive.org/dandiset/000719/draft
  2. Use the dandi download command: dandi download DANDI:000719/draft
  3. Observe the output (below is sample output in middle of download -- note especially the lines with output such as:
    0 Bytes 31.0 MB 0% downloading 1256 done
(nov4) (base) aaronkanzer@Aarons-MacBook-Pro 000719 % dandi download DANDI:000719/draft          
PATH                                                    SIZE      DONE            DONE% CHECKSUM STATUS                 MESSAGE         
000719/dandiset.yaml                                                                             done                   updated         
...T210000_behavior+ophys_NestedDirectoryStore_nwb.zarr                                                                                 
...ses-1214621812_icephys_NestedDirectoryStore_nwb.zarr 0 Bytes   1.2 MB             0%          error                  AssertionError  
...ephys/sub-1214579789_ses-1214621812_icephys.nwb.zarr 0 Bytes   2.5 MB             0%          error                  AssertionError  
...ephys/sub-1214579789_ses-1214621812_icephys.nwb.zarr 25.8 MB   20.3 MB           78%          downloading            10469 done      
...ses-1214621812_icephys_NestedDirectoryStore_nwb.zarr 25.9 MB   20.4 MB           78%          downloading            10500 done      
...phys/sub-npI3_ses-20190421_behavior+ecephys.nwb.zarr 0 Bytes   184.7 MB           0%          downloading            741 done        
...90421_behavior+ecephys_NestedDirectoryStore_nwb.zarr 0 Bytes   179.8 MB           0%          downloading            720 done        
...s/sub-R6_ses-20200206T210000_behavior+ophys.nwb.zarr 0 Bytes   31.0 MB            0%          downloading            1256 done       
...79789_ses-1214621812_icephys_DirectoryStore.nwb.zarr                                                                                 
...ses-1214621812_icephys_NestedDirectoryStore_nwb.zarr                                                                                 
...200206T210000_behavior+ophys_DirectoryStore.nwb.zarr                                                                                 
...200206T210000_behavior+ophys_DirectoryStore.nwb.zarr       
  1. Verify values stored in Heroku PostgresDB for given zarr assets

Exec into Django shell -- python manage.py shell -- then:

from dandiapi.api.models import *

[print(f"Name: {archive.name}, Size: {archive.size}") for archive in Dandiset.objects.filter(pk="000719").zarr_archives.all()]

Notice that some assets have content size of 0

  1. Verify that a given zarr asset does have size larger than 0 in AWS -- for example:
aws s3 ls s3://dandiarchive/zarr/dbbf7b82-c649-409b-a1ae-3b28d1991628/ --recursive --human-readable --summarize

The links below correspond with that assets dandiarchive API and UI

https://api.dandiarchive.org/api/dandisets/000719/versions/draft/assets/23182373-62a6-4747-b6cd-ac7e37f0bb15/
https://dandiarchive.org/dandiset/000719/draft/files?location=ophys_DirectoryStore_9_29_24&page=1

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions