minio

Commit Graph

Author	SHA1	Message	Date
Harshavardhana	572b1721b2	set max API requests automatically based on RAM (#10421 )	4 years ago
Harshavardhana	b0e1d4ce78	re-attach offline drive after new drive replacement (#10416 ) inconsistent drive healing when one of the drive is offline while a new drive was replaced, this change is to ensure that we can add the offline drive back into the mix by healing it again.	4 years ago
Harshavardhana	eb19c8af40	Bump response header timeout for proxying list request (#10420 )	4 years ago
Klaus Post	2d58a8d861	Add storage layer contexts (#10321 ) Add context to all (non-trivial) calls to the storage layer. Contexts are propagated through the REST client. - `context.TODO()` is left in place for the places where it needs to be added to the caller. - `endWalkCh` could probably be removed from the walkers, but no changes so far. The "dangerous" part is that now a caller disconnecting will propagate down, so a "delete" operation will now be interrupted. In some cases we might want to disconnect this functionality so the operation completes if it has started, leaving the system in a cleaner state.	4 years ago
poornas	0037951b6e	improve error message when remote target missing (#10412 )	4 years ago
Andreas Auernhammer	fbd1c5f51a	certs: refactor cert manager to support multiple certificates (#10207 ) This commit refactors the certificate management implementation in the `certs` package such that multiple certificates can be specified at the same time. Therefore, the following layout of the `certs/` directory is expected: ``` certs/ │ ├─ public.crt ├─ private.key ├─ CAs/ // CAs directory is ignored │ │ │ ... │ ├─ example.com/ │ │ │ ├─ public.crt │ └─ private.key └─ foobar.org/ │ ├─ public.crt └─ private.key ... ``` However, directory names like `example.com` are just for human readability/organization and don't have any meaning w.r.t whether a particular certificate is served or not. This decision is made based on the SNI sent by the client and the SAN of the certificate. *** The `Manager` will pick a certificate based on the client trying to establish a TLS connection. In particular, it looks at the client hello (i.e. SNI) to determine which host the client tries to access. If the manager can find a certificate that matches the SNI it returns this certificate to the client. However, the client may choose to not send an SNI or tries to access a server directly via IP (`https://<ip>:<port>`). In this case, we cannot use the SNI to determine which certificate to serve. However, we also should not pick "the first" certificate that would be accepted by the client (based on crypto. parameters - like a signature algorithm) because it may be an internal certificate that contains internal hostnames. We would disclose internal infrastructure details doing so. Therefore, the `Manager` returns the "default" certificate when the client does not specify an SNI. The default certificate the top-level `public.crt` - i.e. `certs/public.crt`. This approach has some consequences: - It's the operator's responsibility to ensure that the top-level `public.crt` does not disclose any information (i.e. hostnames) that are not publicly visible. However, this was the case in the past already. - Any other `public.crt` - except for the top-level one - must not contain any IP SAN. The reason for this restriction is that the Manager cannot match a SNI to an IP b/c the SNI is the server host name. The entire purpose of SNI is to indicate which host the client tries to connect to when multiple hosts run on the same IP. So, a client will not set the SNI to an IP. If we would allow IP SANs in a lower-level `public.crt` a user would expect that it is possible to connect to MinIO directly via IP address and that the MinIO server would pick "the right" certificate. However, the MinIO server cannot determine which certificate to serve, and therefore always picks the "default" one. This may lead to all sorts of confusing errors like: "It works if I use `https:instance.minio.local` but not when I use `https://10.0.2.1`. These consequences/limitations should be pointed out / explained in our docs in an appropriate way. However, the support for multiple certificates should not have any impact on how deployment with a single certificate function today. Co-authored-by: Harshavardhana <harsha@minio.io>	4 years ago
Harshavardhana	1c6781757c	add missing ListBucketVersions from policy actions (#10414 )	4 years ago
Harshavardhana	b4e3956e69	update KES docs to talk about 'mc encrypt' command (#10400 ) add a deprecation notice for KMS_AUTO_ENCRYPTION	4 years ago
Harshavardhana	8a291e1dc0	Cluster healthcheck improvements (#10408 ) - do not fail the healthcheck if heal status was not obtained from one of the nodes, if many nodes fail then report this as a catastrophic error. - add "x-minio-write-quorum" value to match the write tolerance supported by server. - admin info now states if a drive is healing where madmin.Disk.Healing is set to true and madmin.Disk.State is "ok"	4 years ago
Klaus Post	650dccfa9e	cache: Only start at high watermark (#10403 ) Currently, cache purges are triggered as soon as the low watermark is exceeded. To reduce IO this should only be done when reaching the high watermark. This simplifies checks and reduces all calls for a GC to go through `dcache.diskSpaceAvailable(size)`. While a comment claims that `dcache.triggerGC <- struct{}{}` was non-blocking I don't see how that was possible. Instead, we add a 1 size to the queue channel and use channel semantics to avoid blocking when a GC has already been requested. `bytesToClear` now takes the high watermark into account to it will not request any bytes to be cleared until that is reached.	4 years ago
Andreas Auernhammer	9a703befe6	crypto: reduce retry delay when retrying KES requests (#10394 ) This commit reduces the retry delay when retrying a request to a KES server by: - reducing the max. jitter delay from 3s to 1.5s - skipping the random delay when there are more KES endpoints available. If there are more KES endpoints we can directly retry to the request by sending it to the next endpoint - as pointed out by @krishnasrinivas	4 years ago
Klaus Post	9a1615768d	Fix flaky TestXLStorageVerifyFile (#10398 ) `TestXLStorageVerifyFile` would fail 1 in 256 if the first random character was 'a'. Instead write 256 bytes which has 1 in 256^256 probability.	4 years ago
Harshavardhana	37da0c647e	fix: delete marker compatibility behavior for suspended bucket (#10395 ) - delete-marker should be created on a suspended bucket as `null` - delete-marker should delete any pre-existing `null` versioned object and create an entry `null`	4 years ago
Harshavardhana	2acb530ccd	update rulesguard with new rules (#10392 ) Co-authored-by: Nitish Tiwari <nitish@minio.io> Co-authored-by: Praveen raj Mani <praveen@minio.io>	4 years ago
Klaus Post	3e1fb17b70	heal: Check for truncated files (#10399 ) When checking parts we already do a stat for each part. Since we have the on disk size check if it is at least what we expect. When checking metadata check if metadata is 0 bytes.	4 years ago
Klaus Post	a89d6b8e3d	Fix common Windows failure (#10397 ) The `getNonLoopBackIP` may grab an IP from an interface that doesn't allow binding (on Windows), so this test consistently fails. We exclude that specific error.	4 years ago
Klaus Post	1c085f7d1a	Fix crash on Windows when crawling (#10385 ) * readDirN: Check if file is directory `syscall.FindNextFile` crashes if the handle is a file. `errFileNotFound` matches 'unix' functionality: `d19b434ffc/cmd/os-readdir_unix.go (L106)` Fixes #10384	4 years ago
Harshavardhana	4b6585d249	support 'ldap:user' variable replacement properly (#10391 ) also update `ldap.go` examples with latest minio-go changes Fixes #10367	4 years ago
Harshavardhana	9ffad7fceb	discard empty endpoint in crypto kes introduced in `18725679c4`	4 years ago
Andreas Auernhammer	18725679c4	crypto: allow multiple KES endpoints (#10383 ) This commit addresses a maintenance / automation problem when MinIO-KES is deployed on bare-metal. In orchestrated env. the orchestrator (K8S) will make sure that `n` KES servers (IPs) are available via the same DNS name. There it is sufficient to provide just one endpoint.	4 years ago
Anis Elleuch	ba8a8ad818	ListObjectsV1 requests unnecessarily fail with offline nodes (#10386 ) ListObjectsV1 requests are actually redirected to a specific node, depending on the bucket name. The purpose of this behavior was to optimize listing. However, the current code sends a Bad Gateway error if the target node is offline, which is a bad behavior because it means that the list request will fail, although this is unnecessary since we can still use the current node to list as well (the default behavior without using proxying optimization) Currently, you can see mint fails when there is one offline node, after this PR, mint will always succeed.	4 years ago
Harshavardhana	102ad60dee	simplify removing temporary files (#10389 )	4 years ago
Gaige B Paulsen	859ef52886	update for smartos build (solaris too) (#10378 )	4 years ago
Harshavardhana	e730da1438	fix: referesh JWKS public keys upon failure (#10368 ) fixes #10359	4 years ago
Anis Elleuch	46ee8659b4	fix write quorum calculation for bucket operations (#10364 ) When the number of disks is odd, the calculation of quorum for bucket operations were not correct, fix it.	4 years ago
Harshavardhana	a359e36e35	tolerate listing with only readQuorum disks (#10357 ) We can reduce this further in the future, but this is a good value to keep around. With the advent of continuous healing, we can be assured that namespace will eventually be consistent so we are okay to avoid the necessity to a list across all drives on all sets. Bonus Pop()'s in parallel seem to have the potential to wait too on large drive setups and cause more slowness instead of gaining any performance remove it for now. Also, implement load balanced reply for local disks, ensuring that local disks have an affinity for - cleanupStaleMultipartUploads()	4 years ago
Jorge Israel Peña	0a2e6d58a5	hdfs gateway handle listing single files (#10362 )	4 years ago
Klaus Post	1b119557c2	getDisksInfo: Attribute failed disks to correct endpoint (#10360 ) If DiskInfo calls failed the information returned was used anyway resulting in no endpoint being set. This would make the drive be attributed to the local system since `disk.Endpoint == disk.DrivePath` in that case. Instead, if the call fails record the endpoint and the error only.	4 years ago
Harshavardhana	7778fef6bb	update continous heal metrics appropriately for scanned items (#10352 ) bonus make sure to ignore objectNotFound, and versionNotFound errors properly at all layers, since HealObjects() returns objectNotFound error if the bucket or prefix is empty.	4 years ago
飞雪无情	ea1803417f	Use constants for gateway names to avoid bugs caused by spelling. (#10355 )	4 years ago
Harshavardhana	d19b434ffc	fix: bring back delayed leaf detection in listing (#10346 )	4 years ago
Klaus Post	17a1eda702	Disregard healing disks in crawling (#10349 ) When crawling never use a disk we know is healing. Most of the change involves keeping track of the original endpoint on xlStorage and this also fixes DiskInfo.Endpoint never being populated. Heal master will print `data-crawl: Disk "http://localhost:9001/data/mindev/data2/xl1" is Healing, skipping` once on a cycle (no more often than every 5m).	4 years ago
Daniel Valdivia	7d1734d033	indicate through HTTP header cluster healing in progress (#10342 )	4 years ago
Harshavardhana	03ec6adfd0	fix: KES http2.0 communication support (#10341 )	4 years ago
Harshavardhana	309b10f201	keep crawler cycle at 5 minutes	4 years ago
Klaus Post	c097ce9c32	continous healing based on crawler (#10103 ) Design: https://gist.github.com/klauspost/792fe25c315caf1dd15c8e79df124914	4 years ago
Harshavardhana	caad314faa	add ruleguard support, fix all the reported issues (#10335 )	4 years ago
Klaus Post	bc2ebe0021	Only enforce quota on success (#10339 ) We should only enforce quotas if no error has been returned. firstErr is safe to access since all goroutines have exited at this point. If `firstErr` hasn't been set by something else return the context error if cancelled.	4 years ago
Harshavardhana	11aa393ba7	Allow region errors to be dynamic (#10323 ) remove other FIXMEs as we are not planning to fix these, instead we will add dynamism case by case basis. fixes #10250	4 years ago
Praveen raj Mani	d0c910a6f3	Support https and basic-auth for elasticsearch notification target (#10332 )	4 years ago
kannappanr	d15a5ad4cc	S3 Gateway: Check for encryption headers properly (#10309 )	4 years ago
Harshavardhana	95411228db	add missing cleanupStaleMultipartUploads (#10325 ) fixes #10319	4 years ago
ebozduman	23774353b7	get_object() returns NoSuchKey error when object is a prefix (#10315 )	4 years ago
poornas	a2a5ec93d3	fix: use global context for filling cache in the background (#10308 )	4 years ago
Harshavardhana	27a774cbe9	fix: FS mode should reject putBucketVersioning (#10307 )	4 years ago
Klaus Post	8e6787a302	Fix TestDataUpdateTracker hanging (#10302 ) Keep dataUpdateTracker while goroutine is starting. This will ensure the object is updated one `start` returns Tested with ``` λ go test -cpu=1,2,4,8 -test.run TestDataUpdateTracker -count=1000 PASS ok github.com/minio/minio/cmd 8.913s ``` Fixes #10295	4 years ago
Harshavardhana	59352d0ac2	load all blocking metadata in background (#10298 ) most of this metadata already has fallbacks and there is no good reason to load them in blocking fashion	4 years ago
Harshavardhana	75d44b3bae	add disk for more context in bitrot errors (#10296 )	4 years ago
Klaus Post	95ae6c4b49	Fix missing unlock in *healSequence.hasEnded() (#10305 ) The background healing sequence would always hang when this function is called.	4 years ago
KevinSmile	0ebb73ee2e	use const instead of literals (#10292 )	4 years ago

... 3 4 5 6 7 ...

3053 Commits (267d7bf0a9f114065314a0b2863f7fcc9e923012)