exec on skipped files #992

God-damnit-all · 2020-09-12T23:02:07Z

I'm interested in scraping Twitter with retweets, but I don't want to have to constantly redownload the same files from artists retweeting one another.

What I would like to do is, after each download, run a command for copying the file to a different directory structure (or creating a symbolic link). The only problem is exec only runs on files that are downloaded. If it skips a file, no command will be ran.

What I would like is an option to exec on skipped files, exec on downloaded files, or exec on both.

God-damnit-all · 2020-09-13T16:31:56Z

As an aside, if anyone can figure out how to do this with manually editing the code in the meantime, please let me know. Every tweak to the code I've tried has failed spectacularly.

mikf · 2020-09-13T17:45:14Z

For exec specifically, you can replace handle_skip() with the following and it should more or less work:

    def handle_skip(self):
        self.out.skip(self.pathfmt.path)
        if self.postprocessors:
            for pp in self.postprocessors:
                pp.run_after(self.pathfmt)
        if self._skipexc:
            self._skipcnt += 1
            if self._skipcnt >= self._skipmax:
                raise self._skipexc()

God-damnit-all · 2020-09-15T14:45:30Z

For exec specifically, you can replace handle_skip() with the following and it should more or less work:

    def handle_skip(self):
        self.out.skip(self.pathfmt.path)
        if self.postprocessors:
            for pp in self.postprocessors:
                pp.run_after(self.pathfmt)
        if self._skipexc:
            self._skipcnt += 1
            if self._skipcnt >= self._skipmax:
                raise self._skipexc()

I just realized I forgot to thank you for this. Thank you.

You should really set up the ability to receive donations through GitHub, I'd gladly send you a fiver every month.

mikf · 2020-11-25T11:34:13Z

9c3568c makes it is possible to use the following to run a command on skipped files:

{
    "name": "exec",
    "event": "skip",
    "command": "..."
}

To run it for both skipped and newly downloaded files, use "event": "file,skip"

mikf added the feature-request label Sep 13, 2020

mikf closed this as completed Nov 25, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

exec on skipped files #992

exec on skipped files #992

God-damnit-all commented Sep 12, 2020

God-damnit-all commented Sep 13, 2020

mikf commented Sep 13, 2020

God-damnit-all commented Sep 15, 2020

mikf commented Nov 25, 2020

exec on skipped files #992

exec on skipped files #992

Comments

God-damnit-all commented Sep 12, 2020

God-damnit-all commented Sep 13, 2020

mikf commented Sep 13, 2020

God-damnit-all commented Sep 15, 2020

mikf commented Nov 25, 2020