minio

Commit Graph

Author	SHA1	Message	Date
Harshavardhana	2b4eb87d77	pick disks which are common maximally used (#10600 ) further optimization to ensure that good disks are always used for listing, other than healing we only use disks that are maximally used.	4 years ago
Harshavardhana	00eb6f6bc9	cache DiskInfo at storage layer for performance (#10586 ) `mc admin info` on busy setups will not move HDD heads unnecessarily for repeated calls, provides a better responsiveness for the call overall. Bonus change allow listTolerancePerSet be N-1 for good entries, to avoid skipping entries for some reason one of the disk went offline.	4 years ago
Harshavardhana	66174692a2	add '.healing.bin' for tracking currently healing disk (#10573 ) add a hint on the disk to allow for tracking fresh disk being healed, to allow for restartable heals, and also use this as a way to track and remove disks. There are more pending changes where we should move all the disk formatting logic to backend drives, this PR doesn't deal with this refactor instead makes it easier to track healing in the future.	4 years ago
Harshavardhana	7f9498f43f	fix: ignore faulty drives and continue (#10511 ) drives might return different types of errors handle them individually, and for some errors just log an error and continue	4 years ago
Klaus Post	34859c6d4b	Preallocate (safe) slices when we know the size (#10459 )	4 years ago
Klaus Post	fa01e640f5	Continous healing: add optional bitrot check (#10417 )	4 years ago
Anis Elleuch	af88772a78	lifecycle: NoncurrentVersionExpiration considers noncurrent version age (#10444 ) From https://docs.aws.amazon.com/AmazonS3/latest/dev/intro-lifecycle-rules.html#intro-lifecycle-rules-actions ``` When specifying the number of days in the NoncurrentVersionTransition and NoncurrentVersionExpiration actions in a Lifecycle configuration, note the following: It is the number of days from when the version of the object becomes noncurrent (that is, when the object is overwritten or deleted), that Amazon S3 will perform the action on the specified object or objects. Amazon S3 calculates the time by adding the number of days specified in the rule to the time when the new successor version of the object is created and rounding the resulting time to the next day midnight UTC. For example, in your bucket, suppose that you have a current version of an object that was created at 1/1/2014 10:30 AM UTC. If the new version of the object that replaces the current version is created at 1/15/2014 10:30 AM UTC, and you specify 3 days in a transition rule, the transition date of the object is calculated as 1/19/2014 00:00 UTC. ```	4 years ago
Klaus Post	2d58a8d861	Add storage layer contexts (#10321 ) Add context to all (non-trivial) calls to the storage layer. Contexts are propagated through the REST client. - `context.TODO()` is left in place for the places where it needs to be added to the caller. - `endWalkCh` could probably be removed from the walkers, but no changes so far. The "dangerous" part is that now a caller disconnecting will propagate down, so a "delete" operation will now be interrupted. In some cases we might want to disconnect this functionality so the operation completes if it has started, leaving the system in a cleaner state.	4 years ago
Harshavardhana	37da0c647e	fix: delete marker compatibility behavior for suspended bucket (#10395 ) - delete-marker should be created on a suspended bucket as `null` - delete-marker should delete any pre-existing `null` versioned object and create an entry `null`	4 years ago
Klaus Post	3e1fb17b70	heal: Check for truncated files (#10399 ) When checking parts we already do a stat for each part. Since we have the on disk size check if it is at least what we expect. When checking metadata check if metadata is 0 bytes.	4 years ago
Harshavardhana	a359e36e35	tolerate listing with only readQuorum disks (#10357 ) We can reduce this further in the future, but this is a good value to keep around. With the advent of continuous healing, we can be assured that namespace will eventually be consistent so we are okay to avoid the necessity to a list across all drives on all sets. Bonus Pop()'s in parallel seem to have the potential to wait too on large drive setups and cause more slowness instead of gaining any performance remove it for now. Also, implement load balanced reply for local disks, ensuring that local disks have an affinity for - cleanupStaleMultipartUploads()	4 years ago
Harshavardhana	d19b434ffc	fix: bring back delayed leaf detection in listing (#10346 )	4 years ago
Klaus Post	17a1eda702	Disregard healing disks in crawling (#10349 ) When crawling never use a disk we know is healing. Most of the change involves keeping track of the original endpoint on xlStorage and this also fixes DiskInfo.Endpoint never being populated. Heal master will print `data-crawl: Disk "http://localhost:9001/data/mindev/data2/xl1" is Healing, skipping` once on a cycle (no more often than every 5m).	4 years ago
Harshavardhana	74116204ce	handle fresh setup with mixed drives (#10273 ) fresh drive setups when one of the drive is a root drive, we should ignore such a root drive and not proceed to format. This PR handles this properly by marking the disks which are root disk and they are taken offline.	4 years ago
Harshavardhana	e4a44f6224	fix: commonPrefixes behavior in ListObjectVersions (#10286 ) ``` $ aws s3api --profile minio --endpoint-url http://localhost:9003 \ list-object-versions --bucket testbucket \ --delimiter / --prefix Veeam/Archive/ { "CommonPrefixes": [ { "Prefix": "Veeam/Archive/003/" } ] } ``` Also add coverage tests similar to ListObjects to catch errors in future, skip these tests in FS mode	4 years ago
Anis Elleuch	51ba1dac49	listing: Fix result when prefix is an object with a slash (#10267 ) In a non recursive mode, issuing a list request where prefix is an existing object with a slash and delimiter is a slash will return entries in the object directory (data dir IDs) ``` $ aws s3api --profile minioadmin --endpoint-url http://localhost:9000 \ list-objects-v2 --bucket testbucket --prefix code_of_conduct.md/ --delimiter '/' { "CommonPrefixes": [ { "Prefix": "code_of_conduct.md/ec750fe0-ea7e-4b87-bbec-1e32407e5e47/" } ] } ``` This commit adds a fast exit track in Walk() in this specific case.	4 years ago
Harshavardhana	6c6137b2e7	add cluster maintenance healthcheck drive heal affinity (#10218 )	4 years ago
Harshavardhana	a20d4568a2	fix: make sure to use uniform drive count calculation (#10208 ) It is possible in situations when server was deployed in asymmetric configuration in the past such as ``` minio server ~/fs{1...4}/disk{1...5} ``` Results in setDriveCount of 10 in older releases but with fairly recent releases we have moved to having server affinity which means that a set drive count ascertained from above config will be now '4' While the object layer make sure that we honor `format.json` the storageClass configuration however was by mistake was using the global value obtained by heuristics. Which leads to prematurely using lower parity without being requested by the an administrator. This PR fixes this behavior.	4 years ago
Harshavardhana	0b8255529a	fix: proxies set keep-alive timeouts to be system dependent (#10199 ) Split the DialContext's one for internode and another for all other external communications especially proxy forwarders, gateway transport etc.	4 years ago
Harshavardhana	019fe69a57	fix: reduce an extra system call for writes instead fail later (#10187 )	4 years ago
Harshavardhana	b16781846e	allow server to start even with corrupted/faulty disks (#10175 )	4 years ago
poornas	c43da3005a	Add support for server side bucket replication (#9882 )	4 years ago
Harshavardhana	a880283593	Send the lower level error directly from GetDiskID() (#10095 ) this is to detect situations of corruption disk format etc errors quickly and keep the disk online in such scenarios for requests to fail appropriately.	4 years ago
Harshavardhana	17747db93f	fix: support healing older content (#10076 ) This PR adds support for healing older content i.e from 2yrs, 1yr. Also handles other situations where our config was not encrypted yet. This PR also ensures that our Listing is consistent and quorum friendly, such that we don't list partial objects	4 years ago
Harshavardhana	e7d7d5232c	fix: admin info output and improve overall performance (#10015 ) - admin info node offline check is now quicker - admin info now doesn't duplicate the code across doing the same checks for disks - rely on StorageInfo to return appropriate errors instead of calling locally. - diskID checks now return proper errors when disk not found v/s format.json missing. - add more disk states for more clarity on the underlying disk errors.	4 years ago
Harshavardhana	1d65ef3201	fix: deletes on older format properly (#10029 ) while we handle all situations for writes and reads on older format, what we didn't cater for properly yet was delete where we only ended up deleting just `xl.meta` - instead we should allow all the deletes to go through for older format without versioning enabled buckets.	4 years ago
Harshavardhana	c0adb52213	sync to disk only upon successful legacy metadata rename (#10018 )	4 years ago
Anis Elleuch	d4af132fc4	lifecycle: Expiry should not delete versions (#9972 ) Currently, lifecycle expiry is deleting all object versions which is not correct, unless noncurrent versions field is specified. Also, only delete the delete marker if it is the only version of the given object.	4 years ago
Anis Elleuch	21a37e3393	fix: ListObjectVersions should return ordered Version & DeleteMarker (#9959 ) The S3 specification says that versions are ordered in the response of list object versions. mc snapshot needs this to know which version comes first especially when two versions have the same exact last-modified field.	4 years ago
Klaus Post	abd999f64a	fix: list object versions in distributed setup (#9958 ) Remove calls to `WalkVersions` was calling the wrong endpoint, so unless quorum could be reached with local disks no results would ever be returned.	4 years ago
Harshavardhana	174f428571	add additional fdatasync before close() on writes (#9947 )	4 years ago
Klaus Post	cae09d8b84	crawler: Wait max 1 second (#9894 ) Add 1-second timeout to crawler wait. This will make the crawler able to run, albeit very, very slowly on high load servers.	4 years ago
Klaus Post	972d876ca9	Do not select zones with <5% free after upload (#9877 ) Looking into full disk errors on zoned setup. We don't take the 5% space requirement into account when selecting a zone. The interesting part is that even considering this we don't know the size of the object the user wants to upload when they do multipart uploads. It seems quite defensive to always upload multiparts to the zone where there is the most space since all load will be directed to a part of the cluster. In these cases we make sure it can at least hold a 1GiB file and we disadvantage fuller zones more by subtracting the expected size before weighing.	5 years ago
Harshavardhana	9626a981bc	fix: Preserve old data appropriately (#9873 ) This PR fixes all the below scenarios and handles them correctly. - existing data/bucket is replaced with new content, no versioning enabled old structure vanishes. - existing data/bucket - enable versioning before uploading any data, once versioning enabled upload new content, old content is preserved. - suspend versioning on the bucket again, now upload content again the old content is purged since that is the default "null" version. Additionally sync data after xl.json -> xl.meta rename(), to avoid any surprises if there is a crash during this rename operation.	5 years ago
Harshavardhana	94424e14d7	fix: rename legacy xl.json to xl.meta properly in ListDir() (#9863 )	5 years ago
Harshavardhana	4915433bd2	Support bucket versioning (#9377 ) - Implement a new xl.json 2.0.0 format to support, this moves the entire marshaling logic to POSIX layer, top layer always consumes a common FileInfo construct which simplifies the metadata reads. - Implement list object versions - Migrate to siphash from crchash for new deployments for object placements. Fixes #2111	5 years ago
Klaus Post	43d6e3ae06	merge object lifecycle checks into usage crawler (#9579 )	5 years ago
Klaus Post	142b057be8	Check object names on windows (#9798 ) Uploading files with names that could not be written to disk would result in "reduce your request" errors returned. Instead check explicitly for disallowed characters and reject files with `Object name contains unsupported characters.`	5 years ago
Harshavardhana	b2db8123ec	Preserve errors returned by diskInfo to detect disk errors (#9727 ) This PR basically reverts #9720 and re-implements it differently	5 years ago
Harshavardhana	0c71ce3398	fix size accounting for encrypted/compressed objects (#9690 ) size calculation in crawler was using the real size of the object instead of its actual size i.e either a decrypted or uncompressed size. this is needed to make sure all other accounting such as bucket quota and mcs UI to display the correct values.	5 years ago
Anis Elleuch	9baeda781a	fix storage info output with unordered endpoints arguments (#9610 ) Shuffling arguments that we pass to MinIO server are supported. However, when that happens, Prometheus returns wrong information about disks usage and online/offline status. The commit fixes the issue by avoiding relying on xl.endpoints since it is not ordered.	5 years ago
Klaus Post	ee9077db7d	fix: windows tests for all cases (#9594 ) Replaces #9299	5 years ago
Harshavardhana	6ac48a65cb	fix: use unused cacheMetrics code in prometheus (#9588 ) remove all other unusued/deadcode	5 years ago
Anis Elleuch	6885c72f32	disable check for DirectIO in standalone FS mode (#9558 )	5 years ago
Harshavardhana	2dc46cb153	Report correct error when O_DIRECT is not supported (#9545 ) fixes #9537	5 years ago
Harshavardhana	fea4a1e68e	fix logical error in path length handling for windows (#9520 ) fixes #9515	5 years ago
Anis Elleuch	3e063cca5c	Show the cause error in startup when directio is not supported (#9497 ) This commit tries to create a file using direct i/o in the startup so the server returns quickly and avoid cryptic other errors.	5 years ago
Harshavardhana	ab77b216d1	fix: remove restrictions on windows for NAME_MAX (#9469 ) Fixes #9393	5 years ago
Harshavardhana	498389123e	avoid unnecessary logging on fresh/newly replaced drives (#9470 ) data usage tracker and crawler seem to be logging non-actionable information on console, which is not useful and is fixed on its own in almost all deployments, lets keep this logging to minimal.	5 years ago
Anis Elleuch	c434dff0a4	posix: Add missing error return in RenameFile() (#9319 ) Although it should not happen in most cases.	5 years ago

1 2

86 Commits (af9cb5f5f2f64bbf844db55ff70e9c7efe0d47f3)