Blog

BRHosting Blog

News, tutorials, and infrastructure insights from our engineering team

Liquid Cooling Strategies for High-Density GPU Racks
Server Administration

Liquid Cooling Strategies for High-Density GPU Racks

Comparing direct-to-chip, immersion, and rear-door liquid cooling approaches for modern high-density GPU server deployments.

BRHosting Team May 18, 2024
Building GPU Clusters for Large Language Model Training
Server Administration

Building GPU Clusters for Large Language Model Training

A comprehensive guide to designing and deploying GPU clusters optimized for training large language models at scale.

BRHosting Team Jan 12, 2024
NVMe over Fabrics: High-Performance Shared Storage for Modern Data Centers
Server Administration

NVMe over Fabrics: High-Performance Shared Storage for Modern Data Centers

NVMe over Fabrics delivers near-local NVMe storage performance across data center networks, enabling disaggregated storage architectures with unprecedented shared storage speeds.

BRHosting Team Apr 11, 2023
AI Infrastructure: GPU Clusters and the Hardware Behind Machine Learning
Server Administration

AI Infrastructure: GPU Clusters and the Hardware Behind Machine Learning

AI and machine learning workloads demand specialized infrastructure including GPU clusters, advanced cooling, high-speed interconnects, and tiered storage architectures.

BRHosting Team Apr 18, 2022
Ceph Storage: Building a Unified Distributed Storage Cluster
Server Administration

Ceph Storage: Building a Unified Distributed Storage Cluster

Ceph provides scalable, self-healing distributed storage for block, object, and file workloads, offering a cost-effective alternative to proprietary storage solutions.

BRHosting Team Apr 17, 2020
Automating Server Hardening with Ansible Playbooks
Server Administration

Automating Server Hardening with Ansible Playbooks

Ansible playbooks automate server hardening by codifying security baselines into repeatable, auditable configurations that ensure consistent protection across server fleets.

BRHosting Team Sep 18, 2019
Implementing Prometheus Alerting Rules for Proactive Incident Detection
Server Administration

Implementing Prometheus Alerting Rules for Proactive Incident Detection

Design effective Prometheus alerting rules focused on symptoms, appropriate thresholds, and documented runbooks for faster incident response.

BRHosting Team Feb 15, 2019
Configuring NTP Synchronization Across Distributed Server Fleets
Server Administration

Configuring NTP Synchronization Across Distributed Server Fleets

Ensure accurate time synchronization across your server fleet with properly designed NTP hierarchies and monitoring.

BRHosting Team Apr 15, 2018
Implementing IPMI and Out-of-Band Server Management
Server Administration

Implementing IPMI and Out-of-Band Server Management

Deploy and secure IPMI out-of-band management with network isolation, strong authentication, and modern BMC platforms.

BRHosting Team Jun 08, 2017
« 1 2 3 4 5 »