Building an Automated Data Pipeline to Ingest Multi-Page PDF Documents from S3 and Process Them Using Textract, Lambda, and Step Functions