mirror of
https://git.friendi.ca/friendica/friendica-addons.git
synced 2025-01-10 14:44:48 +00:00
295 lines
10 KiB
Markdown
295 lines
10 KiB
Markdown
|
# Akeeba Amazon S3 Connector
|
||
|
|
||
|
A compact, dependency-less Amazon S3 API client implementing the most commonly used features
|
||
|
|
||
|
## Why reinvent the wheel
|
||
|
|
||
|
After having a lot of impossible to debug problems with Amazon's Guzzle-based AWS SDK we decided to roll our own connector for Amazon S3. This is by no means a complete implementation, just a small subset of S3's features which are required by our software. The design goals are simplicity, no external dependencies and low memory footprint.
|
||
|
|
||
|
This code was originally based on [S3.php written by Donovan Schonknecht](http://undesigned.org.za/2007/10/22/amazon-s3-php-class) which is available under a BSD-like license. This repository no longer reflects the original author's work and should not be confused with it.
|
||
|
|
||
|
This software is distributed under the GNU General Public License version 3 or, at your option, any later version published by the Free Software Foundation (FSF). In short, it's "GPLv3+".
|
||
|
|
||
|
## Important note about version 2
|
||
|
|
||
|
Akeeba Amazon S3 Connector version 2 has dropped support for PPH 5.3 to 7.0 inclusive. It is only compatible with PHP 7.1 or later, up to and including PHP 8.0.
|
||
|
|
||
|
The most significant change in this version is that all methods use scalar type hints for parameters and return values. This _may_ break existing consumers which relied on implicit type conversion e.g. passing strings containing integer values instead of _actual_ integer values.
|
||
|
|
||
|
## Using the connector
|
||
|
|
||
|
### Get a connector object
|
||
|
|
||
|
```php
|
||
|
$configuration = new \Akeeba\Engine\Postproc\Connector\S3v4\Configuration(
|
||
|
'YourAmazonAccessKey',
|
||
|
'YourAmazonSecretKey'
|
||
|
);
|
||
|
|
||
|
$connector = new \Akeeba\Engine\Postproc\Connector\S3v4\Connector($configuration);
|
||
|
```
|
||
|
|
||
|
If you are running inside an Amazon EC2 instance you can fetch temporary credentials from the instance's metadata
|
||
|
server using the IAM Role attached to the EC2 instance. In this case you need to do this (169.254.169.254 is a fixed
|
||
|
IP hosting the instance's metadata cache service):
|
||
|
|
||
|
```php
|
||
|
$role = file_get_contents('http://169.254.169.254/latest/meta-data/iam/security-credentials/');
|
||
|
$jsonCredentials = file_get_contents('http://169.254.169.254/latest/meta-data/iam/security-credentials/' . $role);
|
||
|
$credentials = json_decode($jsonCredentials, true);
|
||
|
$configuration = new \Akeeba\Engine\Postproc\Connector\S3v4\Configuration(
|
||
|
$credentials['AccessKeyId'],
|
||
|
$credentials['SecretAccessKey'],
|
||
|
'v4',
|
||
|
$yourRegion
|
||
|
);
|
||
|
$configuration->setToken($credentials['Token']);
|
||
|
|
||
|
$connector = new \Akeeba\Engine\Postproc\Connector\S3v4\Connector($configuration);
|
||
|
```
|
||
|
|
||
|
where `$yourRegion` is the AWS region of your bucket, e.g. `us-east-1`. Please note that we are passing the security
|
||
|
token (`$credentials['Token']`) to the Configuration object. This is REQUIRED. The temporary credentials returned by
|
||
|
the metadata service won't work without it.
|
||
|
|
||
|
Also worth noting is that the temporary credentials don't last forever. Check the `$credentials['Expiration']` to see
|
||
|
when they are about to expire. Amazon recommends that you retry fetching new credentials from the metadata service
|
||
|
10 minutes before your cached credentials are set to expire. The metadata service is guaranteed to provision fresh
|
||
|
temporary credentials by that time.
|
||
|
|
||
|
### Listing buckets
|
||
|
|
||
|
```php
|
||
|
$listing = $connector->listBuckets(true);
|
||
|
```
|
||
|
|
||
|
Returns an array like this:
|
||
|
|
||
|
```
|
||
|
array(2) {
|
||
|
'owner' =>
|
||
|
array(2) {
|
||
|
'id' =>
|
||
|
string(64) "0123456789abcdef0123456789abcdef0123456789abcdef0123456789abcdef"
|
||
|
'name' =>
|
||
|
string(8) "someUserName"
|
||
|
}
|
||
|
'buckets' =>
|
||
|
array(3) {
|
||
|
[0] =>
|
||
|
array(2) {
|
||
|
'name' =>
|
||
|
string(10) "mybucket"
|
||
|
'time' =>
|
||
|
int(1267730711)
|
||
|
}
|
||
|
[1] =>
|
||
|
array(2) {
|
||
|
'name' =>
|
||
|
string(10) "anotherbucket"
|
||
|
'time' =>
|
||
|
int(1269516249)
|
||
|
}
|
||
|
[2] =>
|
||
|
array(2) {
|
||
|
'name' =>
|
||
|
string(11) "differentbucket"
|
||
|
'time' =>
|
||
|
int(1354458048)
|
||
|
}
|
||
|
}
|
||
|
}
|
||
|
```
|
||
|
|
||
|
### Listing bucket contents
|
||
|
|
||
|
```php
|
||
|
$listing = $connector->getBucket('mybucket', 'path/to/list/');
|
||
|
```
|
||
|
|
||
|
If you want to list "subdirectories" you need to do
|
||
|
|
||
|
```php
|
||
|
$listing = $connector->getBucket('mybucket', 'path/to/list/', null, null, '/', true);
|
||
|
```
|
||
|
|
||
|
The last parameter (common prefixes) controls the listing of "subdirectories"
|
||
|
|
||
|
### Uploading (small) files
|
||
|
|
||
|
From a file:
|
||
|
|
||
|
```php
|
||
|
$input = \Akeeba\Engine\Postproc\Connector\S3v4\Input::createFromFile($sourceFile);
|
||
|
$connector->putObject($input, 'mybucket', 'path/to/myfile.txt');
|
||
|
```
|
||
|
|
||
|
From a string:
|
||
|
|
||
|
```php
|
||
|
$input = \Akeeba\Engine\Postproc\Connector\S3v4\Input::createFromData($sourceString);
|
||
|
$connector->putObject($input, 'mybucket', 'path/to/myfile.txt');
|
||
|
```
|
||
|
|
||
|
From a stream resource:
|
||
|
|
||
|
```php
|
||
|
$input = \Akeeba\Engine\Postproc\Connector\S3v4\Input::createFromResource($streamHandle, false);
|
||
|
$connector->putObject($input, 'mybucket', 'path/to/myfile.txt');
|
||
|
```
|
||
|
|
||
|
In all cases the entirety of the file has to be loaded in memory.
|
||
|
|
||
|
### Uploading large file with multipart (chunked) uploads
|
||
|
|
||
|
Files are uploaded in 5Mb chunks.
|
||
|
|
||
|
```php
|
||
|
$input = \Akeeba\Engine\Postproc\Connector\S3v4\Input::createFromFile($sourceFile);
|
||
|
$uploadId = $connector->startMultipart($input, 'mybucket', 'mypath/movie.mov');
|
||
|
|
||
|
$eTags = array();
|
||
|
$eTag = null;
|
||
|
$partNumber = 0;
|
||
|
|
||
|
do
|
||
|
{
|
||
|
// IMPORTANT: You MUST create the input afresh before each uploadMultipart call
|
||
|
$input = \Akeeba\Engine\Postproc\Connector\S3v4\Input::createFromFile($sourceFile);
|
||
|
$input->setUploadID($uploadId);
|
||
|
$input->setPartNumber(++$partNumber);
|
||
|
|
||
|
$eTag = $connector->uploadMultipart($input, 'mybucket', 'mypath/movie.mov');
|
||
|
|
||
|
if (!is_null($eTag))
|
||
|
{
|
||
|
$eTags[] = $eTag;
|
||
|
}
|
||
|
}
|
||
|
while (!is_null($eTag));
|
||
|
|
||
|
// IMPORTANT: You MUST create the input afresh before finalising the multipart upload
|
||
|
$input = \Akeeba\Engine\Postproc\Connector\S3v4\Input::createFromFile($sourceFile);
|
||
|
$input->setUploadID($uploadId);
|
||
|
$input->setEtags($eTags);
|
||
|
|
||
|
$connector->finalizeMultipart($input, 'mybucket', 'mypath/movie.mov');
|
||
|
```
|
||
|
|
||
|
As long as you keep track of the UploadId, PartNumber and ETags you can have each uploadMultipart call in a separate
|
||
|
page load to prevent timeouts.
|
||
|
|
||
|
### Get presigned URLs
|
||
|
|
||
|
Allows browsers to download files directly without exposing your credentials and without going through your server:
|
||
|
|
||
|
```php
|
||
|
$preSignedURL = $connector->getAuthenticatedURL('mybucket', 'path/to/file.jpg', 60);
|
||
|
```
|
||
|
|
||
|
The last parameter controls how many seconds into the future this URL will be valid.
|
||
|
|
||
|
### Download
|
||
|
|
||
|
To a file with absolute path `$targetFile`
|
||
|
|
||
|
```php
|
||
|
$connector->getObject('mybucket', 'path/to/file.jpg', $targetFile);
|
||
|
```
|
||
|
|
||
|
To a string
|
||
|
|
||
|
```php
|
||
|
$content = $connector->getObject('mybucket', 'path/to/file.jpg', false);
|
||
|
```
|
||
|
|
||
|
### Delete an object
|
||
|
|
||
|
```php
|
||
|
$connector->deleteObject('mybucket', 'path/to/file.jpg');
|
||
|
```
|
||
|
|
||
|
## Configuration options
|
||
|
|
||
|
The Configuration option has optional methods which can be used to enable some useful features in the connector.
|
||
|
|
||
|
You need to execute these methods against the Configuration object before passing it to the Connector's constructor. For example:
|
||
|
|
||
|
```php
|
||
|
$configuration = new \Akeeba\Engine\Postproc\Connector\S3v4\Configuration(
|
||
|
'YourAmazonAccessKey',
|
||
|
'YourAmazonSecretKey'
|
||
|
);
|
||
|
|
||
|
// Use v4 signatures and Dualstack URLs
|
||
|
$configuration->setSignatureMethod('v4');
|
||
|
$configuration->setUseDualstackUrl(true);
|
||
|
|
||
|
$connector = new \Akeeba\Engine\Postproc\Connector\S3v4\Connector($configuration);
|
||
|
```
|
||
|
|
||
|
### HTTPS vs plain HTTP
|
||
|
|
||
|
**It is not recommended to use plain HTTP connections to Amazon S3**. If, however, you have no other option you can tell the Configuration object to use plain HTTP URLs:
|
||
|
|
||
|
```php
|
||
|
$configuration->setSSL(false);
|
||
|
```
|
||
|
|
||
|
### Custom endpoint
|
||
|
|
||
|
You can use the Akeeba Amazon S3 Connector library with S3-compatible APIs such as DigitalOcean's Spaces by changing the endpoint URL.
|
||
|
|
||
|
Please note that if the S3-compatible APi uses v4 signatures you need to enter the region-specific endpoint domain name and the region when initializing the object, e.g.:
|
||
|
|
||
|
```php
|
||
|
// DigitalOcean Spaces using v4 signatures
|
||
|
// The access credentials are those used in the example at https://developers.digitalocean.com/documentation/spaces/
|
||
|
$configuration = new \Akeeba\Engine\Postproc\Connector\S3v4\Configuration(
|
||
|
'532SZONTQ6ALKBCU94OU',
|
||
|
'zCkY83KVDXD8u83RouEYPKEm/dhPSPB45XsfnWj8fxQ',
|
||
|
'v4',
|
||
|
'nyc3'
|
||
|
);
|
||
|
$configuration->setEndpoint('nyc3.digitaloceanspaces.com');
|
||
|
|
||
|
$connector = new \Akeeba\Engine\Postproc\Connector\S3v4\Connector($configuration);
|
||
|
```
|
||
|
|
||
|
If your S3-compatible API uses v2 signatures you do not need to specify a region.
|
||
|
|
||
|
```php
|
||
|
// DigitalOcean Spaces using v2 signatures
|
||
|
// The access credentials are those used in the example at https://developers.digitalocean.com/documentation/spaces/
|
||
|
$configuration = new \Akeeba\Engine\Postproc\Connector\S3v4\Configuration(
|
||
|
'532SZONTQ6ALKBCU94OU',
|
||
|
'zCkY83KVDXD8u83RouEYPKEm/dhPSPB45XsfnWj8fxQ',
|
||
|
'v2'
|
||
|
);
|
||
|
$configuration->setEndpoint('nyc3.digitaloceanspaces.com');
|
||
|
|
||
|
$connector = new \Akeeba\Engine\Postproc\Connector\S3v4\Connector($configuration);
|
||
|
```
|
||
|
|
||
|
### Legacy path-style access
|
||
|
|
||
|
The S3 API calls made by this library will use by default the subdomain-style access. That is to say, the endpoint will be prefixed with the name of the bucket. For example, a bucket called `example` in the `eu-west-1` region will be accessed using the endpoint URL `example.s3.eu-west-1.amazonaws.com`.
|
||
|
|
||
|
If you have buckets with characters that are invalid in the context of DNS (most notably dots and uppercase characters) this will fail. You will need to use the legacy path style instead. In this case the endpoint used is the generic region specific one (`s3.eu-west-1.amazonaws.com` in our example above) and the API URL will be prefixed with the bucket name.
|
||
|
|
||
|
You need to do:
|
||
|
```php
|
||
|
$configuration->setUseLegacyPathStyle(true);
|
||
|
```
|
||
|
|
||
|
Caveat: this will not work with v2 signatures if you are using Amazon AWS S3 proper. It will work with the v2 signatures if you are using a custom endpoint, though. In fact, most S3-compatible APIs implementing V2 signatures _expect_ you to use path-style access.
|
||
|
|
||
|
### Dualstack (IPv4 and IPv6) support
|
||
|
|
||
|
Amazon S3 supports dual-stack URLs which resolve to both IPv4 and IPv6 addresses. By default they are _not_ used. If you want to enable this feature you need to do:
|
||
|
|
||
|
```php
|
||
|
$connector->setUseDualstackUrl(true);
|
||
|
```
|
||
|
|
||
|
Caveat: this option only takes effect if you are using Amazon S3 proper. It will _not_ have any effect with custom endpoints.
|