Logging, Monitoring, and Observability in Google Cloud

Icono Duración del curso
Duración del curso
21 horas
Lugar de impartición
Madrid / Online
Icono modalidad del curso
Modalidad
Aula Virtual, Presencial
Icono Fecha del curso
Fecha de inicio
Próximamente
 

Acerca del curso

Learn how to monitor, troubleshoot, and improve your infrastructure and application performance. Guided by the principles of Site Reliability Engineering (SRE), this course features a combination of lectures, demos, hands-on labs, and real-world case studies. In this course, you’ll gain experience with full-stack monitoring, real-time log management and analysis, debugging code in production, and profiling CPU and memory usage.

Próximas convocatorias

Estamos preparando nuevas convocatorias.
Déjanos tus datos a través del formulario y te avisaremos lo antes posible.

  • Plan and implement a well-architected logging and monitoring infrastructure
  • Define service level indicators (SLIs) and service level objectives (SLOs)
  • Create effective monitoring dashboards and alerts
  • Monitor, troubleshoot, and improve Google Cloud infrastructure
  • Analyze and export Google Cloud audit logs
  • Find production code defects, identify bottlenecks, and improve performance
  • Optimize monitoring costs

  • Google Cloud Platform Fundamentals: Core Infrastructure or equivalent experience
  • Basic scripting or coding familiarity
  • Proficiency with command-line tools and Linux operating
    system environments

Documentación Oficial de Google Cloud - Logging, Monitoring, and Observability in Google Cloud

  • Formador Certificado por CAS Training
  • Más de 5 años de experiencia profesional
  • Más de 4 años de experiencia docente
  • Profesional activo en empresas del sector IT

Modulo 1: Introduction to Google Cloud Monitoring Tools

  • Understand the purpose and capabilities of Google Cloud operations-focused
    components: Logging, Monitoring, Error Reporting, and Service Monitoring
  • Understand the purpose and capabilities of Google Cloud application performance management focused components: Debugger, Trace, and Profiler

Modulo 2: Avoiding Customer Pain

  • Construct a monitoring base on the four golden signals: latency, traffic, errors,
    and saturation
  • Measure customer pain with SLIs
  • Define critical performance measures
  • Create and use SLOs and SLAs
  • Achieve developer and operation harmony with error budgets

Modulo 3: Alerting Policies

  • Develop alerting strategies
  • Define alerting policies
  • Add notification channels
  • Identify types of alerts and common uses for each
  • Construct and alert on resource groups
  • Manage alerting policies programmatically

Modulo 4: Monitoring Critical Systems

  • Choose best practice monitoring project architectures
  • Differentiate Cloud IAM roles for monitoring
  • Use the default dashboards appropriately
  • Build custom dashboards to show resource consumption and application load
  • Define uptime checks to track aliveness and latency

Modulo 5: Configuring Google Cloud Services for Observability

  • Integrate logging and monitoring agents into Compute Engine VMs and images
  • Enable and use Kubernetes Monitoring
  • Extend and clarify Kubernetes monitoring with Prometheus
  • Expose custom metrics through code and with the help of OpenCensus

Modulo 6: Advanced Logging and Analysis

  • Identify and choose among resource tagging approaches
  • Define log sinks (inclusion filters) and exclusion filters
  • Create metrics based on logs
  • Define custom metrics
  • Use Error Reporting to link application errors to Logging
  • Export logs to BigQuery

Modulo 7: Monitoring Network Security and Audit Logs

  • Collect and analyze VPC Flow logs and Firewall Rules logs
  • Enable and monitor Packet Mirroring
  • Explain the capabilities of Network Intelligence Center.
  • Use Admin Activity audit logs to track changes to the configuration or metadata of resources
  • Use Data Access audit logs to track accesses or changes to user-provided resource data
  • Use System Event audit logs to track GCP administrative actions

Modulo 8: Managing Incidents

  • Define incident management roles and communication channels
  • Mitigate incident impact
  • Troubleshoot root causes
  • Resolve incidents
  • Document incidents in a post-mortem process

Modulo 9: Monitoring Network Security and Audit Logs

  • Collect and analyze VPC Flow logs and Firewall Rules logs
  • Enable and monitor Packet Mirroring
  • Explain the capabilities of Network Intelligence Center
  • Use Admin Activity audit logs to track changes to the configuration or metadata of resources
  • Use Data Access audit logs to track accesses or changes to user-provided resource data
  • Use System Event audit logs to track GCP administrative actions

Modulo 10: Optimizing Stackdriver Costs

  • Understand Stackdriver billing
  • Analyze Stackdriver resource utilization
  • Implement best practices for Stackdriver cost control

Solicita Información


CAS TRAINING, S.L.U. , le informa que la finalidad del tratamiento es atender a su solicitud de información, reclamación, duda o sugerencia que realice sobre los productos y/o servicios ofrecidos, así como para mantenerle informado de nuestra actividad la gestión de la relación que nos une, la prestación del servicio contratado, así como el envío de información que pudiera ser de su interés sobre nuestros servicios formativos y de consultoría de negocio.

Podrá retirar su consentimiento y ejercitar los derechos reconocidos en los artículos 15 a 22 del Reglamento (UE) 2016/679, enviando un correo electrónico a rgpd@cas-training.com, adjuntando copia de su DNI o documentación acreditativa de su identidad. Puede solicitar más información rgpd@cas-training.com o www.cas-training.com.

Programa del curso:
Descargar programa en PDF
Compartir: