[ 
https://issues.apache.org/jira/browse/AIRFLOW-4922?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17394442#comment-17394442
 ] 

ASF GitHub Bot commented on AIRFLOW-4922:
-----------------------------------------

xuemengran commented on pull request #6722:
URL: https://github.com/apache/airflow/pull/6722#issuecomment-893956847


   In the 2.1.1 version, I tried to modify the 
airflow/utils/log/file_task_handler.py file to obtain the hostname information 
by reading the log table. 
   I confirmed through debug that I could get the host information in this way, 
but a bigger problem appeared. 
   The task is marked as successful without scheduling, and the log is still 
not viewable, so I confirm that to solve this problem, the host information 
must be written to the task_instance table before the task is executed. 
   I think this bug is very Important, because it directly affects the use of 
airflow in distributed scenarios, please solve it as soon as possible!!!


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@airflow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


> If a task crashes, host name is not committed to the database so logs aren't 
> able to be seen in the UI
> ------------------------------------------------------------------------------------------------------
>
>                 Key: AIRFLOW-4922
>                 URL: https://issues.apache.org/jira/browse/AIRFLOW-4922
>             Project: Apache Airflow
>          Issue Type: Bug
>          Components: logging
>    Affects Versions: 1.10.3
>            Reporter: Andrew Harmon
>            Assignee: wanghong-T
>            Priority: Major
>
> Sometimes when a task fails, the log show the following
> {code}
> *** Log file does not exist: 
> /usr/local/airflow/logs/my_dag/my_task/2019-07-07T09:00:00+00:00/1.log*** 
> Fetching from: 
> http://:8793/log/my_dag/my_task/2019-07-07T09:00:00+00:00/1.log*** 
> Failed to fetch log file from worker. Invalid URL 
> 'http://:8793/log/my_dag/my_task/2019-07-07T09:00:00+00:00/1.log': No host 
> supplied
> {code}
> I believe this is due to the fact that the row is not committed to the 
> database until after the task finishes. 
> https://github.com/apache/airflow/blob/a1f9d9a03faecbb4ab52def2735e374b2e88b2b9/airflow/models/taskinstance.py#L857



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to