Issue with extracting datasets from unzipped files containing .gitkeep files

Hi everyone,

I just stumble in an issue while using the unzipping tool and was wondering if anyone else has experienced the same. I have a zipped folder containing some .gitkeep/.gitignore files, and I want to unzip it in Galaxy. While the unzipping is working correctly and a list of datasets is created, the Extract Dataset tool does not recognize this output.

Here is an example:

Any ideas?

Thanks in advance

Well, yes, the tool seems to ignore hidden files (starting with a .).

Wondering how much of a problem that is really. Why are you interested in these files in Galaxy?

Hi and thanks to confirm this @wm75 :slight_smile:

I’m working on a small project where I want to process zipped folders that MAY contain .gitkeep/.gitignore files. The issue isn’t that these files can’t be read (that would be ok), but rather that any other unzipped file in the datasets list become inaccessible one of these files is present.

Ah, ok, didn’t realize.

Have you tried the new zip explore feature then?

When you’re about to upload a zip archive that ends in .zip through the upload dialog you will nowadays see an explore option:

Currently, this allows you to upload select files as individual datasets only, but you can group them into collections from the history then.

Yep, this would be a nice solution for the problem and I looked into it. However, this would be a bit problematic if many files need to be selected or the folder contain several hundred files.

My idea was to build an automatic workflow for processing specific file present in this .zip files. The folder structure of the zipped files I am working with is highly standardized, so files that I want to process are always located at the same path. Considering this, it would be really easy to build an automatic workflow to filter just the files present in this folder without going manually trough it.

I am aware this can be too use-case specific. but I guess might be relevant if files are located in a git repository…?