How to delete orphan files after re-running migration?

When migrating files I’m copying them manually S3 bucket to bucket and then (since s3fs module is installed) I’m adding files to drupal file system with File::create() and the using "entity:file" plugin to save. But sometime I have different files with same file name so I’m adding version suffix to file name. I.e. img.jpg and then next will be img_1.jpg and then img_2.jpg and so on.

Now, if I create file and it gets version i.e. becomes img_5.jpg and then I roll back that migration file is deleted – all works well. But if I run migration once and I got img_5.jpg and the I run same migration again then I got img_6.jpg (since img_5.jpg name is occupied – file exists) then that img_5.jpg stays forever as orphan and will never be used or deleted. How to avoid that?

Since rollback deletes correct (but only last) file that means migration is aware of what file was used in previous migration. So I should before (or after) creating file with new version check for that remembered by last migration and delete it.

How can that be achieved? How can I know from processor plugin is migration going to create new object or update old?

How can I get file name of file created in previous migration so I can delete it?

This article was republished from its original source.
Call Us: 1(800)730-2416

Pixeldust is a 20-year-old web development agency specializing in Drupal and WordPress and working with clients all over the country. With our best in class capabilities, we work with small businesses and fortune 500 companies alike. Give us a call at 1(800)730-2416 and let’s talk about your project.

FREE Drupal SEO Audit

Test your site below to see which issues need to be fixed. We will fix them and optimize your Drupal site 100% for Google and Bing. (Allow 30-60 seconds to gather data.)

Powered by

How to delete orphan files after re-running migration?

On-Site Drupal SEO Master Setup

We make sure your site is 100% optimized (and stays that way) for the best SEO results.

With Pixeldust On-site (or On-page) SEO we make changes to your site’s structure and performance to make it easier for search engines to see and understand your site’s content. Search engines use algorithms to rank sites by degrees of relevance. Our on-site optimization ensures your site is configured to provide information in a way that meets Google and Bing standards for optimal indexing.

This service includes:

  • Pathauto install and configuration for SEO-friendly URLs.
  • Meta Tags install and configuration with dynamic tokens for meta titles and descriptions for all content types.
  • Install and fix all issues on the SEO checklist module.
  • Install and configure XML sitemap module and submit sitemaps.
  • Install and configure Google Analytics Module.
  • Install and configure Yoast.
  • Install and configure the Advanced Aggregation module to improve performance by minifying and merging CSS and JS.
  • Install and configure Schema.org Metatag.
  • Configure robots.txt.
  • Google Search Console setup snd configuration.
  • Find & Fix H1 tags.
  • Find and fix duplicate/missing meta descriptions.
  • Find and fix duplicate title tags.
  • Improve title, meta tags, and site descriptions.
  • Optimize images for better search engine optimization. Automate where possible.
  • Find and fix the missing alt and title tag for all images. Automate where possible.
  • The project takes 1 week to complete.