• DevOps
    Case Study

    How we built a resilient multi-account, multi-cloud solution for a Health Tech service provider!

    READ CASESTUDY
    icon

    24/7 DevOps as a Service

    Round-the-clock DevOps for uninterrupted efficiency.

    icon

    Infrastructure as a Code

    Crafting infrastructure with ingenious code.

    icon

    CI/CD Pipeline

    Automated CI/CD pipeline for seamless deployments.

    icon

    DevSecOps

    Integrated security in continuous DevOps practices.

    icon

    Hire DevOps Engineers

    Level up your team with DevOps visionaries.

    icon

    Consulting Services

    Navigate success with expert DevOps consulting.

  • TechOps
    Case Study

    How we built a scalable Odoo solution for a Travel Tech service provider!

    READ CASESTUDY

    WEB HOSTING SUPPORT

    icon

    HelpDesk Support

    Highly skilled 24/7 HelpDesk Support

    icon

    Product Support

    Boost your product support with our expertise.

    MANAGED SERVICES

    icon

    Server Management

    Don’t let server issues slow you down. Let us manage them for you.

    icon

    Server Monitoring

    Safeguard your server health with our comprehensive monitoring solutions.

    STAFF AUGMENTATION

    icon

    Hire an Admin

    Transform your business operations with our expert administrative support.

    icon

    Hire a Team

    Augment your workforce with highly skilled professionals from our diverse talent pool.

  • CloudOps
    Case Study

    How we helped a Private Deemed University in India, save US $3500/m on hosting charges!

    READ CASESTUDY
    icon

    AWS Well Architected Review

    Round-the-clock for uninterrupted efficiency

    icon

    Optimize

    Efficient CloudOps mastery for seamless cloud management

    icon

    Manage

    Automated CI/CD pipeline for seamless deployments

    icon

    Migrate

    Upgrade the journey, Migrate & Modernize seamlessly

    icon

    Modernize

    Simplify compliance complexities with our dedicated services

    icon

    FinOps as a Service

    FinOps as a Service

  • SecOps
    Case Study

    How we built a scalable Odoo solution for TravelTech service provider!

    READ CASESTUDY
    icon

    VAPT

    Vulnerability Assessment and Penetration Testing

    icon

    Source Code Review

    Ensuring source code security ans safe practices to reduce risks

    icon

    Security Consultation

    On demand services for improving server security

    icon

    System Hardening

    Reduced vulnerability and proactive protection

    icon

    Managed SoC

    Monitors and maintains system security. Quick response on incidents.

    icon

    Compliance as a Service

    Regulatory compliance, reduced risk

  • Insights
    Case Study

    How we helped a Private Deemed University in India, save US $3,500/m on hosting charges!

    READ CASESTUDY
    icon

    Blog

    Explore our latest articles and insights

    icon

    Case Studies

    Read about our client success stories

    icon

    Flipbook

    Explore our latest Flipbook

    icon

    Events

    Join us at upcoming events and conferences

    icon

    Webinars

    Watch our educational webinar series

  • Our Story
  • Contact Us

Interested to collaborate?

Get in touch with us!

Ready to elevate your business with certified cloud expertise? Contact us today to learn how our team can help you leverage cloud technology to drive growth, streamline operations, and enhance security.

  • AWSAWS
  • Azure CloudAzure Cloud
  • Google CloudGoogle Cloud
  • Akamai CloudAkamai Cloud
  • OVHOVH
  • Digital OceanDigital Ocean
  • HetznerHetzner
  • Kubernetes Consultancy Services
  • K8s & Cloud native Solutions
  • 24/7 Infrastructure Monitoring
  • DevOps as a Service
  • Cloud CI/CD Solutions
  • White Labeled MSP Support
  • Our story
  • Life@SupportSages
  • Insights
  • Careers
  • Events
  • Contact Us

Connect with us!


LinkedInFacebookXInstagramYouTube

aws partneraws advanced partner
SupportSages

Copyright © 2008 – 2026 SupportSages Pvt Ltd. All Rights Reserved.
Privacy PolicyLegal TermsData ProtectionCookie Policy
How to troubleshoot an issue – The sysadmin way

How to troubleshoot an issue – The sysadmin way

Nidhin George

  • 5 min read
How to troubleshoot an issue – The sysadmin way

Generating audio, please wait...

For me, a sysadmin is a jack of all, master of few trades. A good sysadmin can play multiple roles effortlessly, in his professional as well as personal life. Once he is into debugging an issue, don’t disturb. That’s the only condition he would have.

I agree that its difficult to document how a sysadmin should trouble shoot the issue in hand. But this is a humble effort to do so and is based on the Master Class videos of Brendan Gregg, available in YouTube.

When you have an issue in hand, what is the first step you would take ?

How did you come to know about the issue ? Was it your customer or users that reported the issue to you verbally ? Or was it reported by your monitoring system ? Let me de-route here a bit. If the issue was not reported by your monitoring system, it is your responsibility to identify the root cause of the the issue and monitor it, so that you get alerted, before the user gets it next time.

If the issue was reported by the user, don’t take it for granted. You must understand the issue from a sysadmin point of view. For example, if a user claims that the server is down, don’t take it for granted and reboot the server. Try to understand why he thinks his server is down, most probably his website will be down, which require a web server restart or it could be even his database server. Before restarting the services please make sure that his public IP is not blocked in the server firewall.

You should duplicate the issue first and understand the root cause then, solve it. You may need temporary solutions or workarounds. But it is better not to settle for temporary and always go for permanent solution.

Duplication is successful. What’s next ?

You need the right tools with you to take this forward in order to troubleshoot the issue. Not just having the tools is enough, but where, when and how to use it is also important. In the above mentioned case, when the customer says the server is down, you were able to duplicate the issue and found that his website is not working while his email server is working. That’s when you need to know how the systems work.

You will have to use the dig/host/nslookup to see whether the website and the mail server are pointing to the same server. You should have a mental flowchart based on elimination.

If they are on the server, flow of your thoughts should be that are you able to ping and then ssh to the server. Now what if the ssh fails. If SSH fails, what is the error message. Is it “Connection refused” or “Connection reset by peer” or “Connection timed out”. Each of these error messages, should take you to different flow and finally you will be able to solve it.

Promotional banner

What could be the reasons and how would I find it ?

Always check for the error logs. Almost all the services will have an option to enable the debug or error log level, ranging from errors to information type messages. Irrespective of whether it is an error with hardware or software, the errors are logged. All you have to do is to find the location where the errors are logged and demystifying the errors shown.

Not just logging is enough, but the error reporting in those logs should be monitored on a regular basis. Unless there is monitoring, you will be waiting for the users to report the errors and that results in bad reputation.

I found the issue and solved it. What’s next ?

First and foremost you should convince or analyze yourself whether the solution you have proposed or put together is temporary or permanent. It should be a permanent fix and shouldn’t repeat. If you fear that it may happen again, the problem is not solved. Work towards a permanent solution which will make sure that the issue wont repeat and if it ever happens, you should be the first to know and solve it before the issue elevates itself to critical level.

Document the issue and the solution. You may have solved it now, but what if the issue repeats in another office network, say, after 3 years. You may have a vague memory that you have seen this issue somewhere, but would have forgotten by now on how you solved it. You will have to reinvent the wheel. That is why documentation is important so that you can come back any time and solve the issues faster.

This also mean that a good sysadmin will make sure that the issue doesn’t repeat on the same server and he will catch it before it happens. However not every sysadmin does, because a sysadmin by nature is a lazy one. But the genius yet lazy sysadmin does it. He documents everything, so that when he is not around, other sysadmins can solve it, and get things moving without disturbing his sleep.

A good sysadmin believes that “prevention is better than cure” and that “a stitch in time saves nine!”

Get 24/7 technical helpdesk support

  • server
Promotional banner
Promotional banner

ASSP – A Good Open Source and Cheap Alternative to Commercial Spam Fighting Solutions

ASSP – A Good Open Source and Cheap Alternative to Commercial Spam Fighting Solutions
  • Email service
  • Security
logo

PHP-FPM Vulnerability (CVE-2019-11043) can lead to Remote Code Execution on nginx

SupportSages Logo
  • Linux
  • nginx
  • PHP
  • Vulnerabilities
logo

Should you run SSH on alternate ports ?

Should you run SSH on alternate ports ?
  • Security
  • server
logo

WordPress Websites Showing 500 Internal Server Error – Causes and Solutions

WordPress Websites Showing 500 Internal Server Error – Causes and Solutions
  • server
  • WordPress
logo

Posts by Nidhin George

Nidhin has started his career as a Linux System Administrator since 2013 and holds vast experience in server administration, CMS management and cPanel systems. Apart from this he's a movie buff and have a huge collection of movies.